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DEDICATION 


Professor  S  Ramaseshan  attained  the  age  of  60  on  October  10,  1983. 
Ramaseshan  is  known  to  the  scientific  community  in  India  not  only  for  his 
scientific  contributions,  but  also  for  his  dedicated  service  to  this  Academy.  He 
has  served  the  Academy  in  various  capacities  for  the  past  several  years.  His 
contribution  to  the  Academy,  especially  as  Editor  of  Publications  and  as  the 
Chief  Editor  of  Pramana  (a  journal  of  physics),  have  been  invaluable.  There  is 
little  doubt  that  much  of  the  credit  for  strengthening  and  enlarging  the 
publications  activity  of  the  Academy  goes  to  Ramaseshan.  He  is  now  the 
President  of  the  Academy. 

Sivaraj  Ramaseshan  was  born  in  Calcutta  and  had  his  early  education  at 
Nagpur,  obtaining  his  B.Sc(Hons)  degree  in  1943.  He  joined  the  Indian 
Institute  of  Science  soon  after  to  carry  out  research  work  with  Professor 
Sir  C  V  Raman  and  obtained  the  Associateship  of  the  Institute  in  1 948  and 
the  D.Sc.,  degree  from  the  University  of  Nagpur  in  1 951 .  He  was  a  member  of 
the  faculty  of  the  Physics  Department  of  the  Indian  Institute  of  Science 
between  1948  and  1962  and  then  moved  to  the  Indian  Institute  of 
Technology,  Madras,  as  head  of  the  Physics  Department.  He  returned  to 
Bangalore  in  1 966  to  found  a  new  division  devoted  to  materials  science  in  the 
National  Aeronautical  Laboratory.  The  Materials  Science  Division  of  this 
laboratory  soon  became  well  known  as  an  outstanding  research  centre.  He 
returned  to  the  Indian  Institute  of  Science  as  Joint  Director  in  1979  and 
became  Director  in  1 981 . 

Ramaseshan  has  contributed  to  a  variety  of  areas  in  Crystallography, 
Physics  of  Solids,  Optics  and  Materials  Science.  His  sharp  intellectand  clarity 
of  understanding  can  be  readily  seen  from  his  varied  research  publications. 
During  the  early  years  of  his  research  career,  Ramaseshan  carried  out  studies 
on  the  crystallography  of  diamond  and  magneto-optic  rotation.  He  evolved 
criteria  which  determine  clevage  planes  in  diamond  and  based  on  surface 
energies  of  crystallographic  planes,  he  could  explain  many  characteristic 
features  of  the  curved  morphology  of  Indian  diamonds.  He  then  developed  a 
model  to  calculate  the  actual  Faraday  rotation  from  the  measured,  apparent 
rotation.  He  studied  the  effect  of  stress  on  optical  activity  where  he  showed 
that  the  piezo-rotation  tensor  was  an  actual  tensor  of  the  fourth  rank.  His 
interest  in  Faraday  rotation  and  birefringence  led  him  to  study  the  concept  of 
Poincare  representation  of  polarized  light.  Ramaseshan  later  investigated 
optical  and  magneto-optical  activity  in  x-rays  and  neutrons  and  the 
Borrmann  effect  in  optics. 

One  of  the  major  contributions  of  Ramaseshan  has  been  the  application  of 
anomalous  scattering  of  x-rays  and  neutrons  to  crystal  structure  analysis.  This 


technique  has  been  used  by  several  workers  and  has  become  more 
meaningful  with  the  availability  of  synchrotron  radiation.  Ramaseshan  and 
his  students  have  studied  crystal  structures  of  several  important  organic  and 
inorganic  systems  and  a  number  of  leading  crystallographers  in  the  country 
were  trained  in  his  laboratory.  After  moving  to  the  National  Aeronautical 
Laboratory,  Ramaseshan  and  his  co-workers  investigated  a  variety  of 
fundamental  and  applied  problems.  A  new  high  pressure  laboratory  was  set 
up  at  the  National  Aeronautical  Laboratory  where  several  interesting  phase 
transitions  were  investigated.  A  compressible  ion  theory  of  ionic  crystals  was 
developed  to  explain  structural  stabilities  and  phase  transitions  of  a  number 
of  systems.  Under  his  guidance,  the  materials  science  division  contributed 
much  to  the  development  of  new  techniques,  devices  and  materials. 

Professor  Ramaseshan  was  elected  a  Fellow  of  the  Indian  Academy  of 
Sciences  in  1955  at  the  age  of  32.  He  was  elected  Fellow  of  the  Indian 
National  Science  Academy  in  1972.  He  has  received  the  Shanti  Swarup 
Bhatnagar  award  of  the  CSIR  in  Physical  Sciences  and  the  Vasvik  award  in 
Materials  Science.  He  was  a  Jawaharlal  Nehru  Fellow  during  1977-1979. 

Sivaraj  has  endeared  himself  to  the  scientific  community  by  his  fine 
disposition  and  sense  of  humour.  He  has  always  been  most  considerate  and 
helpful  to  his  colleagues.  Sivaraj  has  had  the  invaluable  support  of  his 
charming  wife,  Kausalya,  and  their  three  fine  daughters  who  undoubtedly 
have  provided  much  happiness  and  encouragement. 

On  the  occasion  of  his  attaining  the  age  of  60,  it  is  a  matter  of  great  pleasure 
to  convey  the  best  wishes  of  the  fellows  of  the  Academy  and  other  members 
of  the  scientific  community,  for  many  years  of  creative  scientific  activity  and 
happy  life. 

To  mark  this  occasion,  we  have  brought  out  this  volume  containing  papers 
on  crystallography  published  in  the  special  number  (Vol.  92  Numbers  4  and 
5)  of  the  Proceedings  of  the  Indian  Academy  of  Sciences  (Chemical 
Sciences). 


C  N  R  Rao 
Vice-President  and 
Editor  of  Publications 
Indian  Academy  of  Sciences 


Bangalore 
October  1 983 


SIVARAJ  RAMASESHAN 


He  still  looks  young  and  is  so  unassuming  and  easy  to  talk  with  that  the  young 
visitors  in  my  home  who  met  him  recently  found  it  almost  impossible  to 
believe  that  he  was  nearly  sixty,  the  Director  of  a  great  institute,  one  of  the 
most  important  scientists  in  India. 

In  Oxford,  we  first  began  to  know  something  of  his  qualities  when  he  spent 
a  year  in  the  crystallographic  laboratory  in  1963  -64.  It  might  have  been 
a  difficult  year  for  my  research  group,  I  was  a  great  deal  abroad.  In  fact,  it 
was  a  marvellous  experience  for  them,  being  guided  instead  by  Sivaraj 
Ramaseshan,  with  his  great  learning  and  his  critical  understanding. 
Complicated  structures  were  solved  under  his  hands,  mistakes  put  right,  new 
approaches  to  structure  analysis  developed.  It  was  not  surprising  to  find  that 
the  whole  group  had  become  his  devoted  friends  and  admirers — and  had 
developed  great  respect  for  Indian  science. 

We  were  surprised  to  learn  that  soon  after  his  return  to  India  he  had  becorrw 
a  Director  of  research  not  in  an  academic  institute  but  in  the  Aeronautics 
Research  Laboratory,  to  find  him  beginning  a  wholly  new  line  of  work,  seated 
in  a  desk  in  one  corner  of  a  large  empty  room  with  one  assistant  seated  in  the 
opposite  corner.  That  the  research  laboratory  grew  rapidly,  that  its  work  was 
connected  with  real  life,  that  it  introduced  many  new  productive  lines  in 
related  industries  is  now  well  known,  past  history. 

He  changed  course  again,  needing  a  time  to  think,  he  began  to  work  on  the 
past,  on  the  life  of  C.  V.  Raman,  discovering  new  and  fascinating  aspects  of 
Raman's  researches. 

And  now  in  his  last  few  working  years,  his  very  varied  experiences  have 
enabled  Sivaraj  Ramaseshan  to  make  still  more  fundamental  contributions 
himself  to  the  development  of  science  in  India.  In  his  most  recent  work  as 
Director  of  the  Indian  Institute  of  Science,  he  has  not  only  created  a  happy 
and  fruitful  present  but  has  laid  the  foundation  for  a  future  of  great  promise. 

I  think  of  Sivaraj  Ramaseshan  as  one  whose  influence  is  extraordinarily 
creative,  who  has  the  power  to  change  the  lives  of  people  and  nations.  His 
own  life  was  greatly  affected  by  his  meeting  when  young,  with  Gandhi. 
Something  of  Gandhi's  insight,  of  his  care  for  people  as  individuals,  influence 
all  his  actions. 

I  should  add,  he  belongs  to  the  world,  to  his  friends  in  many  countries,  to 
international  science,  to  everyone's  hopes  for  the  future. 

We  all  greet  him  on  his  birthday  with  great  affection  and  admiration  and 
wish  him  every  happiness,  surrounded  by  his  family,  and  we  look  forward  to 
many  new  beginnings  in  his  life's  work. 


Crab  Mill 
llmington 


Dorothy  Hodgkin 
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The  phase  problem  of  x-ray  crystallography 


HERBERT  HAUPTMAN 

Medical  Foundation  of  Buffalo,  73  High  Street,  Buffalo,  New  York  14203,  USA 

Abstract.  Three  major  themes  are  developed; 

(i)  The  fundamental  principle  of  direct  methods:  The  structure  seminvariants  link  the 
observed  magnitudes  |£|  with  the  desired  phases  </>  of  the  normalized  structure  factors  E. 
Specifically,  for  fixed  enantiomorph,  the  observed  magnitudes  |£|  determine,  in  general, 
unique  values  for  all  the  structure  seminvariants;  the  latter,  in  turn,  as  certain  well  defined 
linear  combinations  of  the  phases,  lead  to  unique  values  for  the  individual  phases  4>- 

(ii)  The  neighbourhood  principle:  For  fixed  enantiomorph,  the  value  of  any  structure 
seminvariant  T  is  primarily  determined,  in  favourable  cases,  by  the  values  of  one  or  more  small 
sets  of  magnitudes  |£|,  the  neighbourhoods  of  £,  and  is  relatively  insensitive  to  the  great  bulk 
of  remaining  magnitudes.  The  conditional  probability  distribution  of  7",  given  the  magnitudes 
in  any  of  its  neighbourhoods,  yields  an  estimate  for  T  which  is  particularly  good  in  the 
favourable  case  that  the  variance  of  the  distribution  happens  to  be  small. 

(iii)  The  extension  concept:  By  embedding  the  structure  seminvariant  T  and  its  symmetry 
related  variants  in  suitable  structure  invariants  Q,  one  obtains  the  extensions  Q  of  the 
seminvariant  r.  In  this  way  the  probabilistic  theory  of  the  structure  seminvariants  is  reduced  to 
that  of  the  structure  invariants,  which  is  well  developed. 

Finally,  the  traditional  techniques  of  direct  methods  are  combined  with  isomorphous 
replacement  and  anomalous  dispersion. 

Keywords.  Direct  methods;  invariant;  seminvariant;  distribution;  neighbourhood;  extension. 


1.  Introduction 

1.1  The  phase  problem 

The  intensities  of  a  sufficient  number  of  x-ray  diffraction  maxima  determine  a  crystal 
structure.  The  available  intensities  usually  exceed  the  number  of  parameters  needed  to 
describe  the  structure.  From  these  intensities  a  set  of  numbers  |£„|  can  be  derived,  one 
corresponding  to  each  intensity.  However  the  elucidation  of  the  crystal  structure 
requires  also  a  knowledge  of  the  complex  numbers  E^^  =  |  E^^\  exp  the  normalized 
structure  factors,  of  which  only  the  magnitudes  \Ef^\  can  be  determined  from 
experiment.  Thus  a  “phase”  unobtainable  from  the  diffraction  experiment,  must  be 
assigned  to  each  |£hI’  problem  of  determining  the  phases  when  only  the 

magnitudes  |£h|  are  known  is  called  “the  phase  problem”.  Owing  to  the  known 
atomicity  of  crystal  structures  and  the  redundancy  of  observed  magnitudes  \E^^\,  the 
phase  problem  is  solvable  in  principle. 

1.2  The  structure  invariants 

The  values  of  the  individual  phases  are  determined  by  the  crystal  structure  and  the 
choice  of  origin.  However,  there  always  exist  certain  linear  combinations  of  the  phases 
whose  values  are  determined  by  the  structure  alone  and  are  independent  of  the  choice  of 
origin.  These  linear  combinations  of  the  phases  are  called  the  structure  invariants. 


I 


1 


2 


Herbert  Hauptman 


1.3  The  structure  seminvariants 

For  all  space  groups  other  than  PI  the  origin  may  not  be  chosen  arbitrarily  if  one  is  to 
exploit  fully  the  space  group  symmetries.  Those  linear  combinations  of  the  phases 
whose  values  are  uniquely  determined  by  the  crystal  structure  and  are  independent  of 
the  choice  of  permissible  origin  are  known  as  the  structure  seminvariants.  Thus  the 
collection  of  structure  invariants  is  a  subset  of  the  collection  of  structure  seminvariants. 
In  the  space  group  PI  the  two  classes  coincide. 

1.4  The  fundamental  principle  of  direct  methods 

For  fixed  enantiomorph,  the  observed  magnitudes  |P|  determine,  in  general,  unique 
values  for  all  the  structure  seminvariants.  The  latter  in  turn,  as  certain  well-defined 
linear  combinations  of  the  phases,  lead  unambiguously  to  unique  values  for  the 
individual  phases.  Thus  the  structure  seminvariants  serve  to  link  the  known  magnitudes 
I E I  with  the  desired  phases  0  (the  fundamental  principle  of  direct  methods).  By  the  term 
“direct  methods”  is  meant  that  class  of  methods  which  exploits  relationships  among  the 
structure  factors  in  order  to  go  directly  from  the  observed  magnitudes  j  P  |  to  the  needed 
phases  0. 

1.5  The  neighbourhood  principle 

For  fixed  enantiomorph,  the  value  of  any  structure  seminvariant  T  is  primarily 
determined,  in  favourable  cases,  by  the  values  of  one  or  more  small  sets  of  observed 
magnitudes  |  E  |,  the  neighbourhoods  of  T,  and  is  relatively  insensitive  to  the  values  of 
the  great  bulk  of  remaining  magnitudes  (the  neighbourhood  principle).  The  conditional 
probability  distribution  of  P,  given  the  magnitudes  in  any  of  its  neighbourhoods,  yields 
an  estimate  for  T  that  is  particularly  good  in  the  favourable  case  that  the  variance  of  the 
distribution  happens  to  be  small. 

1.6  The  extension  concept 

By  embedding  the  structure  seminvariant  T  and  its  symmetry  related  variants  in 
suitable  structure  invariants  Q  one  obtains  the  extensions  Q  of  the  seminvariant  T. 
Owing  to  the  space  group-dependent  relationships  among  the  phases  the  value  of  P  is 
simply  related  to  the  values  of  its  extensions.  In  this  way  the  probabilistic  theory  of  the 
structure  seminvariants  is  reduced  to  that  of  the  structure  invariants,  which  is  well 
developed.  In  particular,  the  neighbourhoods  of  P  are  defined  in  terms  of  the 
neighbourhoods  of  its  extensions. 

1.7  Combining  direct  methods  with  isomorphous  replacement  and  anomalous  dispersion 

Most  “small”  crystal  structures,  i.e.  those  with  fewer  than  some  100  independent, 
nonhydrogen  atoms  per  unit  cell,  are  rather  routinely  solvable  nowadays  by  traditional 
direct  methods.  For  the  solution  of  macromolecular  structures,  on  the  other  hand,  the 
method  of  isomorphous  replacement  is  universally  used,  and  anomalous  dispersion 
often  plays  an  important  supplementary  role.  One  naturally  anticipates  therefore  that 
integrating  the  traditional  techniques  of  direct  methods  with  isomorphous  replacement 
and  anomalous  dispersion  will  strengthen  our  ability  to  solve  complex  structures.  This 
goal  has  recently  been  achieved,  and  the  initial  applications  strongly  suggest  that  the 
expected  improvement  is  in  fact  realized. 
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2.  The  phase  problem 

If  one  denotes  by  (f)^^  the  phase  of  the  structure  factor  F^^, 

^H=  l'^H|exp(i0H),  (1) 

then  the  relationship  between  the  F^^'s  and  the  electron  density  function  p(r)  is  given  by 


and 


p(r)exp  (27riHr)dl^ 


J 

y 


P('‘)  ^  77Z^Hexp(-2;riH  r) 


=  4Z[-^h|  expi((/)H-27rHT), 

'  u 


(2) 


(3) 


in  which  V  represents  the  unit  cell  or  its  volume  and  H  is  a  reciprocal  lattice  vector.  Only 
the  magnitudes  \F^^\  of  a  finite  number  of  structure  factors  are  obtainable  from 
experiment;  the  values  of  the  phases  which  are  also  needed  if  one  is  to  determine  p(r) 
from  (3),  cannot  be  determined  experimentally.  It  follows  that  the  phase  problem,  to 
determine  the  values  of  the  phases  (p^  of  the  structure  factors  F^^  when  only  the 
magnitudes  \F^^  \  are  given,  is,  in  principle,  unsolvable  when  formulated  in  these  terms. 

Next,  suppose  that  the  real  crystal,  with  continuous  electron  density  p(r),  is  replaced 
by  an  idealized  one,  the  unit  cell  of  which  consists  of  N  discrete,  non-vibrating,  point 
atoms.  Then  the  structure  factor  F^^  is  replaced  by  the  normalized  structure  factor  E^^ 
and  (1)  to  (3)  are  replaced  by 


b'H  —  I^hI  (^0h)’ 


(4) 


1 

L  X- exp (2mH •  Tj), 

j=i 


<  £„  exp  ( -  27riH  •  r)  >  H  - -fyj 


Z  fj  exp  [27riH  •  (r^  -  r)] 

j=i 


if  r  -  r^. 


(5) 


=  0  if  r  7^  Tj,  (6) 

where  f  is  the  zero-angle  atomic  scattering  factor,  is  the  position  vector  of  the  atom 
labelled  j,  and 

=  Z  /"^  n  =  1,  2,  3,  .  .  .  .  (7) 

j=  1 

In  the  x-ray  diffraction  case  the  f  are  equal  to  the  atomic  numbers  Zj  and  are  therefore 
all  positive;  in  the  neutron  diffraction  case  some  of  the  f  may  be  negative. 

In  practice  the  magnitudes  |  jEhI  f^e  normalized  structure  factors  E^are  obtainable 
(at  least  approximately)  from  the  observed  magnitudes  \F^^\  while  the  phases  0^,  as 
defined  by  (4)  and  (5),  cannot  be  determined  experimentally.  Since  one  now  requires  only 
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the  2>N  components  of  the  N  position  vectors  r^,  rather  than  the  much  more 
complicated  electron  density  function  p(r),  it  turns  out  that,  in  general,  the  known 
magnitudes  are  more  than  sufficient.  This  is  most  readily  seen  by  replacing  (5)  by 


N 


£h1  =  Z  fj  exp  (27riH  •  r^) 


(8) 


a  system  of  equations  in  which  the  only  unknowns  are  the  3N  components  of  the 
position  vectors  r^.  Since  the  number  of  equations  (8),  equal  to  the  number  of  reciprocal 
lattice  vectors  H  for  which  the  magnitudes  |£h|  are  observed,  usually  exceeds  the 
number  of  unknowns,  3iV,  by  far,  the  system  (8)  is  redundant.  Thus  the  phase  problem 
is,  in  principle,  solvable  when  reformulated  in  terms  of  fixed,  point  atoms. 

3.  The  structure  invariants 

Equation  (6)  implies  that  the  normalized  structure  factors  determine  the  crystal 
structure.  However  (5)  does  not  imply  that,  conversely,  the  crystal  structure  determines 
the  values  of  the  normalized  structure  factors  E^^  since  the  position  vectors  r^  depend 
not  only  on  the  structure  but  on  the  choice  of  origin  as  well.  It  turns  out  nevertheless 
that  the  magnitudes  \E^^\  of  the  normalized  structure  factors  are  in  fact  uniquely 
determined  by  the  crystal  structure  and  are  independent  of  the  choice  of  origin  but  that 
the  values  of  the  phases  depend  also  on  the  choice  of  origin.  Although  the  values  of 
the  individual  phases  depend  on  the  structure  and  the  choice  of  origin,  there  exist 
certain  linear  combinations  of  the  phases,  the  so-called  structure  invariants,  whose 
values  are  determined  by  the  structure  alone  and  are  independent  of  the  choice  of 
origin. 

If  the  origin  of  coordinates  is  shifted  to  a  new  point  having  position  vector  Tq  with 
respect  to  the  old  origin,  then,  from  the  definition  (equation  (5))  of  E^^,  it  follows 
readily  that  the  phase  the  normalized  structure  factor  E^^  with  respect  to  the  old 
origin  is  replaced  by  the  new  phase  (/>'„  with  respect  to  the  new  origin  given  by 


H  —  4*h  27rH‘ro. 


(9) 


Equation  (9)  implies  that  the  linear  combination  of  three  phases, 
•A3  =  <Ah+  ‘Ak'b 


(10) 


is  a  structure  invariant  (triplet)  provided  that 
H  +  K  +  L  =  0; 


(11) 


the  linear  combination  of  four  phases, 
'A4  =  0h+<Ak+<Al+‘Am 


(12) 


is  a  structure  invariant  (quartet)  provided  that 
H  -|-  K  L  +  IVI  =  0; 


(13) 


etc. 
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4.  The  structure  seminvariants 

For  all  space  groups  other  than  PI  the  origin  may  not  be  chosen  arbitrarily  if  the 
simplification  permitted  by  the  space  group  symmetries  is  to  be  realized.  For  example,  if 
a  crystal  has  a  centre  of  symmetry  it  is  natural  to  place  the  origin  at  such  a  centre  while  if 
a  two-fold  screw  axis,  but  no  other  symmetry  element  is  present,  the  origin  would 
normally  be  situated  on  this  symmetry  axis.  In  such  cases  the  permissible  origins  are 
greatly  restricted  and  it  is  therefore  plausible  to  assume  that  many  linear  combinations 
of  the  phases  will  remain  unchanged  in  value  when  the  origin  is  shifted  only  in  the 
restricted  ways  allowed  by  the  space  group  symmetries.  One  is  thus  led  to  the  notion  of 
the  structure  seminvariant,  those  linear  combinations  of  the  phases  whose  values  are 
independent  of  the  choice  of  permissible  origin. 

4.1  The  equivalence  concept 

For  any  space  group  the  coordinates  of  equivalent  positions  depend  upon  the  choice  of 
origin.  Hence  the  functional  form  for  the  geometric  structure  factor  also  depends  on  the 
choice  of  origin.  Two  origins  will  be  said  to  be  equivalent  if  they  give  rise  to  the  same 
functional  form  for  the  geometric  structure  factor.  Alternatively,  two  points  are 
equivalent  if  they  are  geometrically  situated  in  the  same  way  with  respect  to  the 
symmetry  elements.  Thus,  in  the  space  group  PI  all  points  are  equivalent;  in  P I  all  eight 
centres  of  symmetry  are  equivalent  to  each  other,  but  no  other  point  is  equivalent  to  any 
of  these  eight;  in  P2i  all  points  on  any  of  the  four  two-fold  screw  axes  are  equivalent  to 
each  other,  bdt  no  other  point  is  equivalent  to  a  point  on  a  two-fold  screw  axis;  in  P4  all 
points  on  either  of  the  two  four-fold  axes  are  equivalent  to  each  other,  all  points  on 
either  of  the  two  two-fold  axes  are  equivalent  to  each  other,  but  no  point  on  a  two-fold 
axis  is  equivalent  to  any  point  on  a  four-fold  axis.  All  points  equivalent  to  a  given  point 
are  equivalent  to  each  other  and  are  said  to  form  an  equivalence  class. 

4.2  The  primary  origin 

Refer  to  “International  Tables  of  X-Ray  Crystallography”,  Vol.  1,  in  order  to  define  the 
primary  origin  for  each  space  group. 

4.3  The  permissible  origins 

All  points  equivalent  to  the  primary  origin  constitute  the  permissible  origins  for  each 
space  group. 

4.4  The  structure  seminvariants 

The  structure  seminvariants  are  those  linear  combination  of  the  phases  whose  values 
are  uniquely  determined  by  the  crystal  structure  alone,  no  matter  what  the  choice  of 
permissible  origin.  Alternatively,  for  a  given  functional  form  for  the  geometric  structure 
factor,  the  values  of  the  structure  seminvariants  are  determined  by  the  structure  alone. 

The  structure  invariants  and  seminvariants  have  been  tabulated  for  all  the  space 
groups  (Hauptman  and  Karle  1953,  1956,  1959;  Karle  and  Hauptman  1961;  Lessinger 
and  Wondratschek  1975).  A  few  examples  will  illustrate  the  method  used  to  identify  the 
structure  seminvariants  in  the  different  space  groups. 
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4.5  Space  Goup  PI 

Refer  to  “International  Tables  of  X-Ray  Crystallography”,  Vol.  1.  to  infer  that  there  are 
eight  permissible  origins  in  PI: 

(0  0  0),  (i  0  0),  (0  i  0),  (0  0  i),  (i  i  0),  (i  0  i),  (0  i  i),  (i  i  i).  (14) 

4.5a.  The  one-phase  structure  seminvariants  From  (9)  it  follows  that  </> '„  =  0h 
all  permissible  origins  Tq  if  and  only  if  H  •  Tq  is  an  integer,  i.e.  in  view  of  (14),  if  and  only  if 
the  components  of  H  are  even.  Hence  the  single  phase  (/»„  is  a  structure  seminvariant  in 
PI  if  and  only  if  the  three  components  of  H  are  even  integers. 

4.5b.  The  two-phase  structure  seminvariants  (f)  y^-\-  (f)^.  Follow  the  argument  in  §4.5.ato 
infer  that  the  linear  combination  of  two  phases  +  ^  structure 

seminvariant  if  and  only  if  the  three  components  of  H  +  K  are  even  integers. 

4.6  Space  groups  PI  and  Pl^;  Unique  axis  b. 

The  permissible  origins  are 

(OyO),  (Oyi),  (iyO),  (i  y  i),  (15) 

where  y  is  arbitrary.  Then  from  (9)  infer  that  the  single  phase  is  a  structure 
seminvariant  if  and  only  if  h  and  /  are  both  even  and  k  —  0;  the  linear  combination  of 
two  phases 

is  a  structure  seminvariant  if  and  only  if  /11+I12  and  I1  +  I2  are  both  even  and 
fcj  -t-  /c2  =  0;  etc. 

4.7  Space  group  P4 

The  permissible  origins  are 

(OOz),  (Hz),  (16) 

where  z  is  arbitrary.  It  follows  from  (9)  that  the  single  phase  is  a  structure 
seminvariant  if  and  only  if  -t-  /c  is  even  and  /  =  0;  the  linear  combination  of  two  phases 

is  a  structure  seminvariant  if  and  only  if  (/ij  -f  k^)  +  (/ij  +  /C2)  is  even  and  1^+12  =  0;  etc. 

4.8  Origin  and  enantiomorph  specification 

The  theory  of  the  structure  seminvariants  leads  in  a  natural  way  to  space  group 
dependent  recipes  for  origin  and  enantiomorph  specification. 

In  general  the  theory  identifies  an  appropriate  set  of  phases  whose  values  are  to  be 
specified  in  order  to  fix  the  origin  uniquely.  For  example,  in  space  group  PI  the  values 
of  any  three  phases 


(17) 
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for  which  the  determinant  A  satisfies 


A  = 


/ij  /ci  /i 
^2  ^2  ^2 


±  1, 


(18) 


may  be  specified  arbitrarily,  thus  fixing  the  origin  uniquely.  Once  this  is  done  then  the 
value  of  any  other  phase  is  uniquely  determined  by  the  structure  alone.  For 
enantiomorph  specification  it  is  sufficient  to  specify  arbitrarily  the  sign  of  any 
enantiomorph  sensitive  structure  invariant,  x.e.  one  whose  value  is  different  from  0  or  n. 
(See  Hauptman  1972,  pages  28-52,  for  further  details.) 

In  the  space  group  PI  one  again  specifies  arbitrarily  the  value  (0  or  n)  of  three  phases 
(17),  but  now  the  condition  is  that  the  determinant  A  [defined  by  (18)]  be  odd. 

Similar  recipes  for  all  the  space  groups  are  now  known  and  are  to  be  found  in  the 
literature  cited. 


5.  The  fundamental  principle  of  direct  methods 

It  is  known  that  the  values  of  a  sufficiently  extensive  set  of  cosine  seminvariants  (the 
cosines  of  the  structure  seminvariants)  lead  unambiguously  to  the  values  of  the 
individual  phases  (Hauptman  1972).  Magnitudes  |£|  are  capable  of  yielding  estimates 
of  the  cosine  seminvariants  only  or,  equivalently,  the  magnitudes  of  the  structure 
seminvariants;  the  signs  of  the  structure  seminvariants  are  ambiguous  because  the  two 
enantiomorphous  structures  permitted  by  the  observed  magnitudes  |  E  |  correspond  to 
two  values  of  each  structure  seminvariant  differing  only  in  sign.  However,  once  the 
enantiomorph  has  been  selected  by  specifying  arbitrarily  the  sign  of  a  particular 
enantiomorph  sensitive  structure  seminvariant  {i.e.  one  different  from  0  or  n),  then  the 
magnitudes  |  E  \  determine  both  signs  and  magnitudes  of  the  structure  seminvariants 
consistent  with  the  chosen  enantiomorph.  Thus,  for  fixed  enantiomorph,  the  observed 
magnitudes  |  E  |  determine  unique  values  for  the  structure  seminvariants;  the  latter,  in 
turn,  lead  to  unique  values  of  the  individual  phases.  In  short,  the  structure 
seminvariants  serve  to  link  the  observed  magnitudes  |  E  \  with  the  desired  phases  f  (the 
fundamental  principle  of  direct  methods).  It  is  this  property  of  the  structure 
seminvariants  which  accounts  for  their  importance  and  which  justifies  the  stress  placed 
on  them  here. 


6.  The  Neighbourhood  principle:  Estimating  the  structure  invariants 

It  has  long  been  known  that,  for  fixed  enantiomorph,  the  value  of  any  structure 
seminvariant  ij/  is,  in  general,  uniquely  determined  by  the  magnitudes  |£|  of  the 
normalized  structure  factors.  Recently  it  has  become  clear  that,  for  fixed  enantiomorph, 
there  corresponds  to  ip  one  or  more  small  sets  of  magnitudes  |  E  |,  the  neighbourhoods 
of  ip,  on  which,  in  favourable  cases,  the  value  of  ip  most  sensitively  depends;  that  is  to  say 
that,  in  favourable  cases,  ip  is  primarily  determined  by  the  values  of  |  £  |  in  any  of  its 
neighbourhoods  and  is  relatively  independent  of  the  values  of  the  great  bulk  of 
remaining  magnitudes.  The  conditional  probability  distribution  of  \p,  assuming  as 
known  the  magnitudes  |  £  |  in  any  of  its  neighbourhoods,  yields  an  estimate  for  ip  which 
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is  particularly  good  in  the  favourable  case  that  the  variance  of  the  distribution  happens 
to  be  small  [the  neighbourhood  principle  (Hauptman,  1975a,  b)]. 


6.1  The  three-phase  structure  invariant  {triplet)  in  PI 

6.1.a.  The  first  neighbourhood  of  the  three-phase  structure  invariant.  If  H,  K,  and  L  are 
three  reciprocal  lattice  vectors  satisfying  (11),  then  the  first  neighbourhood  of  the  three- 
phase  structure  invariant  ij/^  [equation  (10)]  is  defined  to  consist  of  the  three 
magnitudes 

|£h|.  |£k|.  |£lI-  (19) 

6.1b.  The  probabilistic  background.  Suppose  that  a  crystal  structure  consisting  of  N 
atoms  (not  necessarily  identical)  per  unit  cell  in  PI  is  fixed.  Denote  reciprocal  space  by 
IF and  by  IFx  IFx  IFthe  three-fold  Cartesian  product  consisting  of  all  ordered  triples 
(H,  K,  L)  of  reciprocal  lattice  vectors.  Assume  also  that  Pj,  P2»3Bd  P,  are  fixed  positive 
numbers.  Suppose  finally  that  the  primitive  random  variable  (vector)  is  the  ordered 
triple  (H,  K,  L)  of  reciprocal  lattice  vectors  which  is  assumed  to  be  uniformly 
distributed  over  the  subset  o^Wy.W^KW  defined  by 

|£hI  =  R..  |£lI  =  R3.  (20) 

and  (11).  Then  the  three-phase  structure  invariant  i/!>3  [equation  (10)],  as  a  function  of 
the  primitive  random  variable  (H,  K,  L),  is  itself  a  random  variable. 

6.1c.  The  condition  probability  distribution  of  the  three-phase  structure  invariant  ij/^, 
assuming  as  known  the  three  magnitudes  in  its  first  neighbourhood.  Denote  by 

P,|3=P('F|Pi,P2,P3)  (21) 

the  conditional  probability  distribution  of  1/^3,  given  (20).  Then  (Cochran  1955; 
Hauptman  1976b) 


P 


1|3 


1 

2nloiA) 


exp  {A  cos'F), 


(22) 


where  Iq  is  the  modified  Bessel  function. 


A  = 


2<t^ 


Pi  P2P3, 


(23) 


and  cr„  is  defined  by  (7).  Since  A  >  0,P ,  |3  has  a  unique  maximum  at  'P  =  0,  and  it  is  clear 
that  the  larger  the  value  of  A  the  smaller  is  the  variance  of  the  distribution.  See  figure  1, 
where  A  =  2-316,  figure  2,  where  A  =  0-731.  Hence  in  the  favourable  case  that  A  is 
large,  say,  for  example,  A  >  3,  the  distribution  leads  to  a  reliable  estimate  of  the 
structure  invariant  1/^3,  zero  in  this  case: 

i/^3  «  0  if  T  is  large.  (24) 


Furthermore,  the  larger  the  value  of  A,  the  more  likely  is  the  probabilistic  statement 
(24).  It  is  remarkable  how  useful  this  relationship  has  proved  to  be  in  the  applications; 
and  yet  (24)  is  severely  limited  because  it  is  capable  of  yielding  only  the  zero  estimate  for 
i/^3,  and  only  those  estimates  are  reliable  for  which  A  is  large,  the  favourable  cases. 
There  is  nothing  in  the  older  formulations  of  this  relationship  which  suggests,  for 
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Figure  1.  The  distribution  equation  (22)  for  A  =  2-316. 


p 


Figure  2.  The  distribution  Pi,3,  equation  (22)  for  A  —  0-731. 


example,  how  one  may  obtain  a  reliable  estimate  for  if/^in  the  neighbourhood  of  n.  A 
major  goal  in  the  present  formulation  is  to  make  clear  how  the  results  described  here 
are  to  be  extended  to  structure  invariants  in  general  and  to  yield  better  estimates  which 
may  lie  anywhere  in  the  interval  ( —  n,  n). 

It  should  perhaps  be  mentioned  in  passing  that  another  consequence  of  the 
distribution  (22)  is  the  so-called  tangent  formula  which  is  universally  used  by  direct 
methods  practitioners: 

’’  <  l^k^h-kl cos (</),, -f(/)h_k)>k’ 

in  which  h  is  a  fixed  reciprocal  lattice  vector,  the  averages  are  taken  over  the  same  set 
of  vectors  k  in  reciprocal  space,  usually  restricted  to  those  vectors  k  for  which  \E^^\  and 
are  both  large,  and  the  sign  of  sin 0,, (cos (/>,,)  is  the  same  as  the  sign  of  the 
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numerator  (denominator)  on  the  right  hand  side.  The  tangent  formula  is  usually  used  to 
refine  and  extend  a  basis  set  of  phases,  presumed  to  be  known. 

6.2  The  four-phase  structure  invariant  (quartet)  in  PL 

6.2a.  The  first  neighbourhood  of  the  four-phase  structure  invariant.  If  H,  K,  L,  and  M 
are  four  reciprocal  lattice  vectors  satisfying  (13),  then,  in  analogy  with  §6.  la,  the  first 
neighbourhood  of  the  four-phase  structure  invariant  1/^4  [equation  (12)]  is  defined  to 
consist  of  the  four  magnitudes 

I^hI’  I-^kI’  (2^) 

6.2b.  The  probabilistic  background.  One  employs  the  same  probabilistic  background 
as  in  §6.1  b  with  the  difference  that  now  one  specifies  four  positive  numbers  R^,  R2,  R3, 
and  /?4  and  assumes  that  the  primitive  random  variable  is  the  ordered  quadruple 
(H,  K,  L,  M)  of  reciprocal  lattice  vectors  which  is  uniformly  distributed  over  the  subset 
of  the  four-fold  Cartesian  product  Wx  Wx  Wx  IT  defined  by 

I-^hI  ~  -^1’  I-^mI  ~  -^45  (26) 

and  (13).  Then  the  four-phase  structure  invariant  1/^4  [equation  (12)],  as  a  function  of 
the  primitive  random  variable  (H,  K,  L,  M),  is  itself  a  random  variable. 

6.2c.  The  conditional  probability  distribution  of  the  four-phase  structure  invariant  1/^4, 
assuming  as  known  the  four  magnitudes  in  its  first  neighbourhood.  Denote  by 

P,|4=P('F|i?i,/?2,i?3,i?4),  (27) 

the  conditional  probability  distribution  of  1//4,  given  (26).  Then  (Hauptman,  1975a,  b), 

^  1 14 ~  (^ cos (28) 

Znl o[d) 

where 

B  =  (29) 

and  a„  is  defined  by  (7).  Thus  Tjuis  identical  with  Fj^,  but  B  replaces  A.  Hence  similar 
remarks  apply  to  F ,  14.  In  particular,  (28)  always  has  a  unique  maximum  at  T  =  0  so  that 
the  most  probable  value  of  1//4,  given  the  four  magnitudes  (26)  in  its  first  neighbour¬ 
hood,  is  zero,  and  the  larger  the  value  of  B  the  more  likely  it  is  that  1/^4  0.  Since  B 

values,  of  order  1/N,  tend  to  be  less  than  A  values,  of  order  l/yjN,  at  least  for  large 
values  of  N,  the  estimate  (zero)  of  1/^4  is  in  general  less  reliable  than  the  estimate  (zero)  of 
1/^3.  Hence  the  goal  of  obtaining  a  reliable  non-zero  estimate  for  a  structure  invariant  is 
not  realized  by  (28).  The  decisive  step  in  this  direction  is  made  next. 

6.2d.  The  second  neighbourhood  of  the  four-phase  structure  invariant  If  one  as¬ 
sumes  as  known  not  only  the  four  magnitudes  (25),  but  the  additional  three  magnitudes 
I^h+k|’  1^k+l|>  iri  favourable  cases,  one  obtains  a  more  reliable  estimate 

for  1//4  and,  furthermore,  the  estimate  may  lie  anywhere  in  the  interval  0  to  n.  The  idea  to 
use  these  three  additional  magnitudes  is  suggested  by  the  following  heuristic  argument 
(Schenk  and  de  Jong  1973;  Schenk  1973,  1974;  Hauptman  1976a). 
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Assume  first  that  the  four  magnitudes  (25)  are  all  large: 

I^h|»  I-EkI-  I-ElI’  I are  all  large,  (30) 

where  H,  K,  L,  M  satisfy  (13).  If  it  should  happen  that  |£h+kI  large  then  (24) 

implies 

^ -H-K  ~  (31) 

and,  in  view  of  (13), 

</>L+</>M+</»H  +  K~  0-  (32) 

Then,  by  the  addition  of  (31)  and  (32), 

(33) 

If  |£k+lI  Is  large  or  if  I^l+hI  Is  large  then,  by  symmetry,  the  relationship  (33)  again 
holds.  If  it  should  happen  that 

I-^h+kI’  |^k+l|’  I^l+hI  ^11  large,  (34) 


then  one  would  again  expect  (33)  to  hold,  but  now  with  very  high  probability.  However, 
the  distribution  (28)  implies  that  the  value  of  1/^4  must  occasionally  be  in  the 
neighbourhood  of  n,  even  if  B  is  quite  large.  How  is  one  to  identify  the  small  fraction  of 
quartets  which  are  in  fact  near  tt?  In  view  of  the  foregoing  argument  it  is  plausible  to 
suppose  that 

~  ^  (35) 

precisely  in  the  case  that 

I^h+kI’  I-Ek+lI-  1El+m|  are  all  small.  (36) 

Although  the  argument  presented  here  is  heuristic  only  and  proves  nothing,  it  does 
suggest  the  question:  What  is  the  conditional  probability  distribution  of  the  quartet  1/^4, 
assuming  as  known  the  seven  magnitudes  |£h1’  I^kI’  |El1»  |E|v,|,  |Eh+k|’  |Ek+l|’  IEl+hI*^ 
In  view  of  this  discussion  one  defines  the  second  neighbourhood  of  1/^4  to  consist  of 
the  seven  magnitudes 

|Eh|’  IEkI)  |El|,  I-EmI’  [Eh+kI’  |Ek+lI>  |El+h1-  (32) 

6.2e.  The  conditional  probability  distribution  of  the  four-phase  structure  invariant  1/^4, 
assuming  as  known  the  seven  magnitudes  in  its  second  neighbourhood.  Assume  that  the 
seven  positive  numbers  Ej,  R2,  R3,  R4,  E12,  E23,  E31  are  fixed.  Suppose  next  that  the 
ordered  quadruple  of  reciprocal  lattice  vectors  (H,  K,  L,  M)  is  a  random  variable  which 
is  uniformly  distributed  over  the  subset  of  the  fourfold  Cartesian  product  WxWxW 
X  W  defined  by 

|Eh|  ~  Ej,  |E,j|  =  R2,  |El|  =  E3,  |E,y,|  =  E4;  (38) 

|Eh  +  k|  ~  Ei25  IEk+lI  =  E23,  |El+h1  ~  E31;  (39) 

and  (13).  Then  1/^4 ,  as  a  function  of  the  primitive  random  variable  (H,  K,  L,  M),  is  itself  a 
random  variable.  Denote  by 

Ei|7  ~  E('F|Ej,  R2,  R3,  R4,  Ej2?  E23,  R31) 


(40) 
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the  conditional  probability  distribution  of  the  quartet  1//4,  given  the  seven  magnitudes 
(38)  and  (39).  Then  (Hauptman,  1975a,  b;  1976b) 


where 


y  exp(  — 2B'  cosT')/ol 


I  X  ' 


B'  —  (3(73  —  (^2(7^)  R1R2R3R4, 

U'2 

X,2  =  lRiRl  +  RIRI  +  2R1R2R3R4  cos'F]'/^ 
X2i  =  [RlRl  +  RiRl  +  2RiR2R3R^cos^'y/\ 
X31  =  [RlRl  +  RlRl  +  2R,R2R3R^cos^>y>\ 


(41) 

(42) 

(43) 

(44) 

(45) 


a„  is  defined  by  (7),  and  L  is  a  normalizing  parameter,  independent  of  'F,  which  is  not 
needed  for  the  present  purpose. 

Figures  3-5  show  the  distribution  (41)  (solid  line - )  for  typical  values  of  the  seven 

parameters  (38)  and  (39).  For  comparison  the  distribution  (28)  (broken  line - )  is  also 

shown.  Since  the  magnitudes  |  E  |  have  been  obtained  from  a  real  structure,  comparison 
with  the  true  value  of  the  quartet  is  also  possible.  As  already  emphasized,  the 
distribution  (28)  always  has  a  unique  maximum  at  'F  =  0.  The  distribution  (41),  on  the 
other  hand,  may  have  a  maximum  at  'F  =  0,  or  n,  or  any  value  between  these  extremes, 
as  shown  by  figures  3-5.  Roughly  speaking,  the  maximum  of  (41)  occurs  at  0  or  ;i 
according  as  the  three  parameters  R12,  i?23)  ^31  are  all  large  or  all  small,  respectively, 
thus  confirming  the  heuristic  argument  given  earlier.  These  figures  also  clearly  show  the 
improvement  which  may  result  when,  in  addition  to  the  four  magnitudes  (38),  the  three 


p 


Figure  3.  The  distribution  (41)  ( - )  and  (28)( - )  for  the  values  of  the  seven 

parameters  (38)  and  (39)  shown.  The  mode  of  (41)  is  0,  of  (28)  always  0. 
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Figure  4.  The  distribution  (41)  ( - )  and  (28)  ( - )  for  the  values  of  the  seven 

parameters  (38)  and  (39)  shown.  The  mode  of  (41)  is  105°,  of  (28)  always  0. 


p 


Figure  5.  The  distribution  (41)  (- - )  and  (28)  ( - )  for  the  values  of  the  seven 

parameters  (38)  and  (39)  shown.  The  mode  of  (41)  is  180°,  of  (28)  always  0. 


magnitudes  (39)  are  also  assumed  to  be  known.  Finally,  in  the  special  case  that 

■^12  ^  ■^23  ^  ^  ^ 

the  distribution  (41)  reduces  to 

y  exp  (-25'  cos*F), 

which  has  a  unique  maximum  at  T  =  tt  (fig.  5). 


(46) 

(47) 


6.3  Space  group  PI 

In  complete  analogy  with  51,  conditional  probability  distributions  for  the  triplet  (^3  in 
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PI,  given  three  magnitudes,  and  for  the  quartet  1/^4  in  PI,  given  four  or  seven 
magnitudes,  are  readily  derived.  The  major  difference  is  that  now  the  distributions  to  be 
described  are  discrete,  because  the  invariants  take  on  only  the  two  values  0,  n.  Using  the 
definitions  and  terminology  of  §§6.1  and  6.2,  the  present  account  is  greatly  abbreviated. 


6.3a.  The  conditional  probability  distribution  of  the  triplet  1/^3  in  PI,  assuming  as  known 
the  three  magnitudes  in  its  first  neighbourhood.  Make  the  same  hypotheses  as  in  §6.1. 
Denote  by  P^iP^)  the  conditional  probability  that  1/^3  =  0(7r),  given  the  three 
magnitudes  (20).  Then  (Woolfson  1954;  Hauptman  1976b) 

P^  =  ^Zi,  (48) 

Z3±  =  exp(±^/2),  (49) 

and 

X3  =  2  cosh  {All),  (50) 

where  A  is  given  by  (23)  and  upper  (lower)  signs  in  (48)  and  (49)  go  together.  Equations 
(48)  to  (50)  should  be  compared  with  the  PI  analogue,  (22).  It  follows  that 

Pt>h  (51) 

so  that  the  estimate  1/^3  =  n,  given  only  the  three  magnitudes  (20),  is  impossible. 

6.3b.  The  conditional  probability  distribution  of  the  quartet  1/^4  in  PI,  assuming  as  known 
the  four  magnitudes  in  its  first  neighbourhood.  Denote  by 

P;orP4,  (52) 


the  conditional  probability  that  the  quartet  1/^4  be  equal  to  0  or  rr  respectively,  given  the 
four  magnitudes  (26).  Then  (Green  and  Hauptman  1976;  Hauptman  et  al  1976; 
Hauptman  1976b) 


where 

Zf  =  exp  (  +  P/2), 

=  Z4  +  Z4  , 


(53) 

(54) 

(55) 


and  P  is  defined  by  (29).  The  distribution  (53)-(55)  should  be  compared  with  the  PI 
analogue  (28).  The  variance  of  the  distribution  (53)  is  given  by 


Var  =  4P7P, 


Clearly 


P  ^ 


>i 


{4z:zf)/iz: 


+  Zf)^ 


(56) 

(57) 


so  that  (53)  can  give  only  the  zero  estimate  for  the  quartet.  Finally,  the  smaller  the  value 
of  the  variance,  the  more  reliable  is  the  estimate,  zero  in  this  case. 


6.3.C.  The  conditional  probability  distribution  of  the  quartet  1/^4  in  PI,  assuming  as  known 
the  seven  magnitudes  in  its  second  neighbourhood.  Denote  by 

P|  or  P7 


(58) 
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the  conditional  probability  that  the  quartet  1/^4  be  equal  to  0  or  tt  respectively,  given  the 
seven  magnitudes  (38)  and  (39).  Then  (Green  and  Hauptman  1976;  Hauptman  and 
Green  1976;  Hauptman  1976b;  but  see  also  Giacovazzo,  1976a  for  a  different  but 
related  result) 


p  ±  —  ^  7  ± 

where 

=  exp(+B') cosh|^^i?i2^±2^cosh|^^/?23A^2~3^  ^ 

^^3/2  ^31  ^31^’ 

X  1*2  —  +  R3R4., 

^  23  —  R2R3  i  1^45 

X  *1  =  i?3  J?1  +  R2R4, 

Kj  —  Z  2  Z  2  , 
and  the  variance  is  given  by 


(59) 


(60) 

(61) 

(62) 

(63) 

(64) 


Var  =  4F^P7 


4Z2+Z7 

{z;+z^f 


(65) 


In  contrast  to  (53),  may  lie  anywhere  in  the  interval  0  to  1  so  that  (59)-(64)  may  give 
either  the  0  or  tt  estimate  for  the  quartet.  In  complete  analogy  with  PI,  the  estimate  is  0 
or  Tt  according  sls  R12,  ^23^  R31  are  all  large  or  all  small  respectively,  the  favourable 
cases.  In  the  extreme  case  that 


P12  ^  R23  ^  R31  ^  0,  (b6) 

(60)  reduces  to 

Zj^  =  exp(+B').  (67) 

Again,  the  smaller  the  value  of  the  variance,  the  more  reliable  is  the  estimate,  0  or  tt. 


6.4  The  discriminant  of  the  five-phase  structure  invariant  (quintet) 

6.4a.  The  first  two  neighbourhoods.  The  linear  combination  of  five  phases 

•As  =  <^H'h  ‘Al+ <^M+ (68) 
is  a  structure  invariant  (quintet)  if 

H  +  K  +  L  +  M  +  N  =  0.  (69) 

In  complete  analogy  with  the  earlier  work  on  quartets,  the  first  neighbourhood  of  1// 5  is 
defined  to  consist  of  the  five  magnitudes 

I^hI  ~  Ri’  I-^kI  ~  R2>  I-^lI  ~  R3’  I^mI  ~  R^y  I-EnI  ~  Rs^  C70) 

and  the  second  neighbourhood  of  the  five  magnitudes  (70),  the  “main  terms”,  plus  the 
additional  ten  magnitudes,  the  “cross-terms”. 
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I^h  +  k| 

—  ^12) 

I^^h+lI 

==  ^13» 

1 F  1 

1  ^h+m| 

Rl4f  I-^H  +  nI  ~  7?i5, 

I^K  +  lI 

—  ^23) 

I^k+mI 

=  R245 

|£k,nI 

—  ^25> 

I^l+mI 

—  ^34) 

1£l4nI 

=  ^35> 

I-^M  +  nI 

^45> 

(71) 

i.e.  15  magnitudes  |£|  in  all. 

6.4b.  The  heuristic  argument.  By  means  of  plausible  reasoning,  in  analogy  with  §6.2d., 
one  is  led  to  identify  those  quintets  having  the  extreme  values  0  or  ti  as  summarized  in 
table  1,  the  entries  of  which  give  the  values,  large  (L)  or  small  (S)  of  the  15  magnitudes, 
(70)  and  (71),  in  the  second  neighbourhood  of  1/^5  corresponding  to  1//5  =  0  or  7t. 

6.4c.  The  probabilistic  theory  of  quintets  in  PI  and  PI  (Schenk  1975;  Hauptman  1977; 
Putten  and  Schenk  1977;  Fortier  and  Hauptman  1977a,  b;  Hauptman  and  Fortier 
1977).  Using  the  probabilistic  background  described  earlier  and  referring  to  the 
literature  one  is  led  to  define  the  discriminant  A  of  the  quintet  1/^5  as  follows: 


A  =  Rj 


2(7  \ 


„9I1 
yi  15 


iRliR 


34 


la-. 


(3(Ti-(T2  0'4)Z^ 


2 

12 


10 


r?/2 


{\Sal  -  10020^0 4,  yala^) 


where 


(72) 


TT  D  2  n2  _  p2  p2  ,  Q  2  p2  _i_p2  p2  ip2  p2 

2^^12^34  —  ^12^34  +  ^12^35''"^12^45+^13^24 
15 

+  ^13^25+^13^45  +  ^14^23  +  ^14^25 
+  ^14^35  +  ^15^23  +  ^15^24  +  ^15^34 

+  RhRls  +  Rl^Rl5  +  RIsRIa,  (73) 


Table  1.  The  probable  values  of  the  structure  invariant  +  +  + 

given  the  15  magnitudes  in  its  second  neighbourhood;  L  means  large,  S  means  small. 


•As 

R. 

R2 

R3 

R4 

R5 

Ri2 

Ri3 

R,4 

R.5 

R23 

R24 

R25 

R34 

R35 

R45 

0 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

n 

L 

L 

L 

L 

L 

L 

L 

L 

L 

S 

S 

S 

S 

S 

S 

n 

L 

L 

L 

L 

L 

L 

S 

S 

S 

L 

L 

L 

s 

S 

S 

It 

L 

L 

L 

L 

L 

S 

L 

s 

s 

L 

S 

S 

L 

L 

s 

n 

L 

L 

L 

L 

L 

S 

S 

L 

s 

S 

L 

s 

L 

S 

L 

n 

L 

L 

L 

L 

L 

S 

s 

S 

L 

s 

S 

L 

S 

L 

L 

71 

L 

L 

L 

L 

L 

L 

L 

S 

S 

L 

S 

S 

S 

S 

S 

7t 

L 

L 

L 

L 

L 

L 

S 

L 

S 

S 

L 

s 

S 

s 

s 

71 

L 

L 

L 

L 

L 

L 

S 

S 

L 

S 

S 

L 

S 

s 

s 

K 

L 

L 

L 

L 

L 

S 

L 

L 

S 

S 

s 

S 

L 

s 

s 

K 

L 

L 

L 

L 

L 

S 

L 

S 

L 

s 

s 

s 

S 

L 

s 

71 

L 

L 

L 

L 

L 

S 

S 

L 

L 

s 

s 

s 

S 

S 

L 

71 

L 

L 

L 

L 

L 

S 

S 

S 

S 

L 

L 

s 

L 

S 

S 

71 

L 

L 

L 

L 

L 

S 

s 

s 

s 

L 

s 

L 

S 

L 

s 

71 

L 

L 

L 

L 

L 

S 

s 

s 

s 

S 

L 

L 

s 

S 

L 

71 

L 

L 

L 

L 

L 

S 

s 

s 

s 

S 

S 

s 

L 

L 

L 
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(74) 


As  described  in  the  literature  the  discriminant  A  is  readily  computable  and  strongly 
correlated  with  extreme  values  of  the  quintet  in  FI  and  PI  in  the  sense  that  1/^5  «  0  or 
1//5  «  7t  according  as  A  >  0  or  A  0,  respectively,  all  in  accord  with  table  1. 

6.5  The  neighbourhood  principle 

It  has  been  known  for  many  years  that,  for  a  chosen  enantiomorph,  the  values  of  the 
structure  factor  magnitudes  |  E  \  uniquely  determine,  in  general,  {i.e.  provided  that  no 
homometric  solutions,  other  than  enantiomorphs,  exist)  the  values  of  the  structure 
invariants.  Alternatively,  for  a  fixed  functional  form  for  the  geometric  structure  factor 
and  for  fixed  choice  of  enantiomorph,  the  magnitudes  |£|  uniquely  determine  the 
values  of  the  structure  seminvariants.  A  number  of  formulas  explicitly  relating  the 
cosine  invariants  to  the  magnitudes  of  all  (or  large  numbers  of)  structure  factors  E  have 
in  fact  been  known  for  some  time  and  these  have  played  an  important  role  in  the 
applications.  However  their  value  has  been  somewhat  limited  by  the  requirement  that 
certain  kinds  of  rational  dependence  among  atomic  coordinates  not  be  present,  and  for 
very  complex  structures  their  usefulness  is  greatly  reduced.  The  results  described  in  the 
preceding  paragraphs  however  point  strongly  to  a  deeper  insight.  Instead  of  seeking  to 
express  the  structure  seminvariants  in  terms  of  all  observed  structure  factor  magnitudes 
|£|,  or  even  in  terms  of  large  numbers  of  |£|’s,  as  the  earlier  formulas  did,  it  is  now 
suggested  that  the  value  of  each  structure  seminvariant  if/  is  primarily  determined,  in 
favourable  cases,  by  the  values  of  one  or  more  small  sets  of  appropriately  chosen 
magnitudes  |£|,  the  so-called  neighbourhoods  of  xj/,  and  is  relatively  insensitive  to  the 
vast  bulk  of  the  remaining  magnitudes  (the  neighbourhood  principle,  Hauptman 
1975a,  b). 

Thus  in  the  favourable  case  that  A,  (23),  is  large  the  value  of  the  three-phase  structure 
invariant  xpj  is  primarily  determined  by  the  three  magnitudes  (20)  and  the  larger  the 
value  of  A  the  more  reliable  is  the  estimate  for  1/^3,  zero  in  this  case,  as  reference  to  (22) 
shows.  Hence  the  first  neighbourhood  of  the  three-phase  structure  invariant  1//3  is 
defined  to  consist  of  the  three  magnitudes  (20). 

Next,  if  one  is  given  only  the  four  magnitudes  (26),  then  the  most  probable  value  of 
the  quartet  xf/^  is  zero  in  view  of  (28),  and  in  the  favourable  case  that  B,  (29),  is  large  the 
zero  estimate  is  good.  Thus  the  first  neighbourhood  of  the  quartet  xj/4,  is  defined  to 
consist  of  the  four  magnitudes  (27). 

Suppose  next  that  the  seven  magnitudes  (38)  and  (39)  are  given.  In  this  case  the  most 
probable  value  of  the  quartet  1//4  is  given  by  the  maximum  of  the  distribution  (41)  and 
this  estimate  may  lie  anywhere  in  the  range  (0,  n)  [or,  because  of  symmetry,  in  the  range 
(  —  71,0)],  depending  on  the  values  of  the  seven  parameters  (38)  and  (39).  In  the 
favourable  case  that  the  variance  of  the  distribution  (41)  is  very  small  then  the  estimate 
for  xj/4,  is  particularly  good.  The  second  neighbourhood  of  the  quartet  t/^4  is  therefore 
defined  to  consist  of  the  seven  magnitudes  (38)  and  (39). 

The  variance  of  the  distribution  (41)  may  be  greater  than  or  less  than  that  of  (28).  If  B, 
(29),  is  fixed  then,  in  the  favourable  cases  that  the  three  magnitudes  (39)  are  all  large  or 
all  small  the  variance  of  (41)  will  be  smaller  than  that  of  (28)  and  the  estimate  for  xj/^ 
given  by  (41)  will  then  be  more  reliable  than  that  given  by  (28).  Thus  the  gain  in  going 
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from  the  first  to  the  second  neighbourhood  is  that  the  potential  for  obtaining  a 
distribution  with  a  small  variance  is  increased;  (although  it  should  be  emphasized  that 
this  potential  is  not  necessarily  realized  in  every  given  case);  in  short  it  becomes  possible 
to  find  a  more  reliable  estimate  for  1/^4.,  and  the  estimate  is  no  longer  restricted  to  be  zero 
but  may  instead  lie  anywhere  in  the  interval  (0,  n). 

The  formulation  of  the  neighbourhood  principle  has  stimulated  the  rapid  develop- 
ment  of  the  probabilistic  theory  of  the  structure  invariants  and  seminvariants.  Thus, 
with  the  derivation  of  conditional  probability  distributions  of  the  four-phase  structure 
invariants,  assuming  as  known  the  magnitudes  in  the  third  and  higher  neighbourhoods, 
the  probabilistic  theory  of  quartets  is  now  reasonably  complete,  at  least  for  the  space 
groups  PI  and  PI  (Hauptman  1977a,  b,  c).  The  theory  of  quintets  and  sextets  has  been 
initiated  (Giacovazzo  1976c;  Hauptman  1977;  Fortier  et  al  1977a,  b,  c;  Hauptman  et  al 
1977a,  b),  and  a  beginning  has  been  made  on  the  probabilistic  theory  of  the  structure 
seminvariants  for  selected  space  groups  (Hauptman  1979,  1980;  Burla  et  al  1981; 
Cascarano  et  al  1982;  Busetta  et  al  1980;  Giacovazzo  1977, 1979, 1980a,  b,  c;  Giacovazzo 
et  al  1979;  Giacovazzo  and  Vickovic  1980).  Not  only  have  these  developments  been 
heavily  dependent  on  the  neighbourhood  principle  but  the  results  obtained  confirm 
conclusively  the  validity  of  the  neighbourhood  principle  for  the  structure  invariants 
and  structure  seminvariants  in  general. 

7.  The  extension  concept:  Estimating  the  structure  seminvariants 

By  embedding  the  structure  seminvariant  T  and  its  symmetry  related  variants  in 
suitable  structure  invariants  Q  one  obtains  the  extensions  Q  of  the  seminvariant  T. 
Owing  to  the  space  group  dependent  relations  among  the  phases,  T  is  related  in  a 
known  way  to  its  extensions.  In  particular  the  neighbourhoods  of  T  are  defined  in 
terms  of  the  neighbourhoods  of  its  extensions.  The  process  will  be  illustrated  in  some 
detail  for  the  three-phase  structure  seminvariant  in  PI  which  serves  as  the  prototype  for 
the  structure  seminvariants  in  general,  in  all  space  groups,  noncentrosymmetric  as  well 
as  centrosymmetric. 

7.1  The  three-phase  structure  seminvariant  T  in  PI 

7.1a.  The  three-phase  structure  seminvariant.  The  linear  combination  of  three  phases 


T  —  <P  4>  ({>  i_, 


(75) 


is  a  structure  seminvariant  in  PI  if  and  only  if  the  three  components  of  the  reciprocal 
lattice  vector  H  +  K  +  L  are  even.  Then  the  components  of  the  eight  reciprocal  lattice 
vectors  5(  +  H  +  K  ±  L)  are  all  integers. 

7.1b.  The  extensions  of  T.  Following  Hauptman  (1977b,  1978;  but  see  also 
Giacovazzo,  1977,  for  an  equivalent  procedure)  one  embeds  the  three-phase  structure 
seminvariant 


^  0K+ 

and  its  three  symmetry  related  variants 
Pi  —  (p 4*  L' 


(76) 


(77) 
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^3  =  </*H+ </’k+ (^9) 

in  the  respective  quintets 

Co  —  0 -^(H  +  K  +  L)+ -i(H  +  K  +  L)»  (^11) 

Cl  ~  ^1  +  </* -i(-H  +  K+L)+ -i(-H  +  K  +  L)>  (^1) 

C2  =  ^2  +  0 -i(H-K  +  L)+  0-i(H-K  +  L)>  (^2) 

C3  ~  ^3  +  0 -i(H  +  K-L)+ 0 -i(H  +  K-L)-  (^3) 


In  view  of  (76)-(79)  and  the  space  group-dependent  relationships  among  the  phases,  it  is 
readily  verified  that  (80)  to  (83)  are  five-phase  structure  invariants  (quintets)  and 


^  —  Co  —  Cl  —  Qi  —  Qi- 


(84) 


Thus  the  probabilistic  theory  of  the  three-phase  structure  seminvariant  T  is  reduced  to 
that  of  the  quintets,  which  is  well  developed.  In  particular  the  neighbourhoods  of  T  are 
defined  in  terms  of  the  neighbourhoods  of  the  quintet. 


7.1c.  The  first  two  neighbourhoods  of  the  extensions.  Clearly  only  four  of  the  five  “main 
terms”  (see  §6.4a  and  Schenk  1975;  Hauptman  1977a)  of  the  special  quintet  Qo  (80)  are 
distinct.  The  first  neighbourhood  of  Qq  is  accordingly  defined  to  consist  of  the  four 
magnitudes 

I^hI’  |^kI>  l'^i(H  +  K  +  L)|-  (8^) 


Referring  again  to  §6.4  it  is  readily  verified  that  only  seven  of  the  ten  “cross-terms”  of 
the  special  quintet  Qq  are  in  fact  distinct.  Thus  the  second  neighbourhood  of  Qq  is 
defined  to  consist  of  the  four  main  terms  (85)  and  the  seven  cross-terms 


'i(- 


"H  +  K|> 


H  +  K  +  L)|> 

E 


K  +  L|’ 


i(H-K  +  L)|> 

|^l+h|!  I 


'i(H  +  K-L)l’ 


'H  +  K  +  Ll’ 


(86) 


or  eleven  magnitudes  |  £  |  in  all.  (In  contrast,  the  second  neighbourhood  of  the  general 
quintet  consists  of  fifteen  magnitudes  |£|.) 

Not  only  does  §6.4  serve  to  identify  the  first  two  neighbourhoods  of  the  special 
quintet  Qq  but  table  1  shows  in  a  qualitative  way  what  the  relationship  between  Qq  and 
the  magnitudes  |  £  |  in  its  second  neighbourhood  must  be.  Thus  table  1  leads,  by  suitable 
specialization,  to  rows  1-8  of  table  2.  Although  table  1  has  16  rows,  only  eight  of  these 
yield  entries  in  table  2  which  are  internally  consistent;  the  remaining  eight  rows  of  table 
1,  which  would  lead  to  contradictory  entries  in  table  2  (arising  from  the  fact  that  only 
seven  of  the  cross-terms  of  the  special  quintet  Qq  are  distinct),  are  therefore  omitted 
from  table  2.  Thus  the  16  rows  of  table  1  yield  only  rows  1-8  of  table  2  for  Qq. 

In  a  similar  way  it  may  be  shown  that  the  first  neighbourhood  of  the  special  quintet 
Qi  consists  of  the  four  magnitudes 


'H|> 


'Ll’  |^i(-H  +  K  +  L-)| 


(87) 


and  the  second  neighbourhood  of  the  four  main  terms  (87)  plus  the  additional  seven 
cross-terms 


'i(H-K  +  L)|’  |■^i(H  +  K-L)|’ 


l^i(H  +  K  +  L)i 
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-h-kI’ 


'K  +  Lh 


'L-H|> 


-H  +  K  +  L|> 


(88) 


i.e.  eleven  magnitudes  l£|  in  all.  Now  the  16  rows  of  table  1  for  2  ^  lead  to  the  eight  rows 
9-16  of  table  2.  Particularly  noteworthy  is  the  fact  that  the  second  neighbourhoods  of 
Qo  and  overlap  and  a  main  term  of  one  becomes  a  cross-term  of  the  other. 

Table  2  is  now  completed  in  the  obvious  way,  rows  17-24  corresponding  to  Q2  and 
rows  25-32  to  Q^,.  Again  the  overlap  of  the  second  neighbourhoods  is  to  be  noted,  and 
the  fact  that  main-terms  of  one  quintet  appear  as  cross-terms  in  others  is  particularly 
noteworthy  (because  of  the  implication  that,  in  order  to  arrive  at  a  proper  definition  of 
the  first  neighbourhood  of  T,  it  is  necessary  to  implicate  the  second  neighbourhoods  of 
its  extensions). 

Certain  rows  of  table  2  are  mutually  contradictory  and  others  are  mutually 
reinforcing.  By  combining  those  rows  of  table  2  which  are  mutually  reinforcing,  (e.g. 
rows  2  and  10  of  table  2  give  row  2  of  table  3)  one  obtains,  in  view  of  (84),  table  3  which 
leads  directly  to  the  first  neighbourhood  of  the  structure  seminvariant  T,  as  shown  next. 

7.1.d.  The  first  neighbourhood  ofT.  The  first  seven  entries  (under  magnitudes  |£|)  in 
rows  2,  8, 1 1,  and  20  of  table  3  are  identical,  thus  leading  to  row  2  of  table  4.  In  a  similar 
way  all  rows  of  table  4  are  obtained.  If  one  makes  the  definitions 

h  =  i(H  +  K  +  L),  hi  =i(-H  +  K  +  L), 

h2  =  i(H-K  +  L),  h3-i(H  +  K-L),  (89) 


Table  4  shows  that,  on  the  assumption  that  all  three  magnitudes 


I-^hI  ~  -^1’  I-^kI  ~  1^l|  ~ 


are  large,  then  T  ss  0  if  all  four  magnitudes 


(90) 


=  ^  |£hj  =  ''2,  |■Eh3H^3 


(91) 


are  large;  but  if  precisely  two  of  the  four  magnitudes  (91)  are  large  and  the  remaining 
two  are  small,  then  T  ^  n.  Thus  the  first  neighbourhood  of  T  is  defined  to  consist  of  the 
seven  magnitudes  (90)  and  (91),  i.e.,  as  it  turns  out,  the  set-theoretic  union  of  the  first 
neighbourhoods  of  all  of  its  extensions.  For  the  three-phase  structure  seminvariant  in 
PI  the  “favourable  cases”  of  the  neighbourhood  principle  are  defined  by  the  entries  of 
table  4,  viz.  the  three  magnitudes  (90)  are  large  and  either  all  four  magnitudes  (91)  are 
large,  in  which  case  P  «  0,  or  precisely  two  of  the  magnitudes  (91)  are  large  and  the 
remaining  two  are  small,  in  which  case  T  n. 


7.1e.  The  conditional  probability  distribution  P-  of  the  three-phase  structure  seminvari¬ 
ant  T,  given  the  seven  magnitudes  in  its  first  neighbourhood.  Suppose  that  a  crystal 
structure  in  PI  is  fixed  and  that  the  seven  non-negative  numbers  Pj,  ^2?  £3;  P  '"i » ''2>  ^3 
are  also  specified.  Denote  reciprocal  space  by  W  and  by  WxWxW  the  three-fold 
Cartesian  product  which  consists  of  all  ordered  triples  (H,  K,  L)  of  reciprocal  vectors. 
Suppose  finally  that  (H,  K,  L)  is  the  primitive  random  variable  which  is  assumed  to  be 
uniformly  distributed  over  the  subset  of  Wx  WxW  defined  by  (90),  (91),  and  the 
condition  that  the  components  of  H  +  K  -f  L  be  even.  Then  the  structure  seminvariant  T 
[equation  (75)],  as  a  function  of  the  primitive  random  variable  (H,  K,  L),  is  itself  a 
random  variable.  Denote  by  P^  or  P“  the  conditional  probability,  given  the  seven 
magnitudes(90)and(91),  that  PbeO  or  7T,  respectively,  or,  equivalently,  that  cosP  —  -)- 1 
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Table  4.  The  probable  value  of  the  structure  seminvariant  T,  given  the  seven  magnitudes  in  its 
first  neighbourhood;  L  means  large;  S  means  small;  obtained  by  combining  reinforcing  rows 
from  the  first  seven  columns  (under  magnitudes  |E|)  of  table  3. 


Row 

Derived  from 
rows  of 
table  3 

T 

H 

K 

(H  +  K  +  L) 

Magnitudes  |  E  | 
i(-H  +  K  +  L) 

i(H-K  +  L) 

i(H  +  K-L) 

1 

1 

0 

L 

L 

L 

L 

L 

L 

L 

2 

2,8,11,20 

n 

L 

L 

L 

L 

L 

S 

S 

3 

3,9,15,21 

n 

L 

L 

L 

L 

S 

L 

S 

4 

4,10,19,22 

n 

L 

L 

L 

L 

S 

S 

L 

5 

5,12,18,23 

n 

L 

L 

L 

S 

L 

S 

L 

6 

6,13,16,24 

n 

L 

L 

L 

S 

L 

L 

S 

7 

7,14,17,25 

n 

L 

L 

L 

S 

S 

L 

L 

or  —  1,  respectively.  The  formula  for  P-,  the  major  result  of  this  section,  is  simply 
(Hauptman  1980) 

P^  =  ~Z^,  +Z-,  (92) 

K 


Z±  =  exp(±n  X  exp(H/±), 


(93) 


2aV^ 


i6al  -  6(T2(T3<T4  +  (^2(^5)  (f  +ri-^ri  +  rj) 


.9/2 


(2(7^  -3<T2(T3(T4 Talas) 


K  j  R2R3, 


+'72»/3>'2^3) 


K  J  r  (2a  1  —  (T2^4)^2^3 


<^2 


T(^2^'‘2  +  '73'/i''3''i) 
+  ('73^''3  +  '7i»/2''i^2) 


—  3/2^2  3(2o'3 

oy  02 


1 


-^3/2 
®  2 


JR3  3  (2a3  —  a2a4)Kji?2 


(94) 


(95) 


where  the  summands  of  (93)  consist  of  the  eight  exponentials  obtained  by  permitting 
each  of  >7 1 ,  >7 2 .  fy  3  to  take  on  the  values  -f  1  or  —  1  independently  in  (95);  otherwise  upper 
(lower)  signs  in  (92)-(95)  go  together. 


7.  If.  The  discriminant  of  T.  Just  as  the  discriminant  of  the  quintet,  a  polynomial  in  the 
15  magnitudes  of  the  second  neighbourhood,  serves  to  identify  and  estimate  those 
quintets  having  the  extreme  value  0  or  n,  so  here  there  exists  a  polynomial  in  the  seven 
magnitudes  constituting  the  first  neighbourhood  of  T,  the  discriminant,  extreme  values 
of  which  serve  to  estimate  reliably  the  value  of  T.  The  discriminant  A  of  the  three-phase 
structure  seminvariant  in  PI  is  defined  by 
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--^{2ol-3a20iO^  +  alas)\RiR2Ri. 


(96) 


The  applications  also  show  that  extreme  values  of  the  discriminant  A  are  well  correlated 
with  the  values  of  T  in  the  sense  that  T  k  Oorn  according  as  A  >  0  or  A  0  respectively. 
Thus,  as  has  been  observed  many  times  previously  (e.g.  Fortier  and  Hauptman  1977c) 
the  discriminant  proves  to  be  a  useful  and  readily  computable  indicator  of  the  value  of 
the  seminvariant. 

Inspection  of  (96)  shows  that,  on  the  assumption  that  the  three  magnitudes  (90)  are 
large,  A  >  0  if  all  four  magnitudes  (91)  are  large;  but  A  0  if  precisely  two  magnitudes 

(91)  are  large  and  the  remaining  two  are  small.  In  short  T  «  0  or  rr  in  the  respective 
cases,  and  the  qualitative  prediction  of  table  4  is  in  fact  borne  out.  Inspection  of  the 
distribution,  (92)-(95),  leads  to  the  same  conclusion,  but  in  a  less  transparent  way. 

Finally,  it  is  not  difficult  to  show  that  if  one  of  the  magnitudes  (91)  is  outside  the 
observed  range,  then  it  is  to  be  replaced  by  unity,  the  average  value  of  |£|^,  in  (96)  or 

(92) -(95)  (see,  for  example,  Giacovazzo  1975;  Heinerman  et  al  1977). 

7.1g.  Concluding  remarks.  In  the  present  section  the  conditional  probability  distri¬ 
bution  of  the  three-phase  structure  seminvariant  T,  given  the  seven  magnitudes  in  its 
first  neighbourhood,  has  been  found  for  space  group  PI.  The  distribution  yields  a 
reliable  estimate  (0  or  tt)  for  T  in  the  favourable  case  that  the  variance  of  the  distribution 
happens  to  be  small.  Because  the  structure  seminvariants  are  available  in  much  larger 
numbers  than  the  structure  invariants,  it  is  anticipated  that  the  results  described  here 
will  find  important  application  in  the  determination  of  real  structures,  and  some  initial 
calculations  confirm  this  expectation. 

Of  particular  importance  is  the  analysis  leading  to  the  definition  of  the  first 
neighbourhood  given  in  §§7.1b  to  7. Id.  This  work  has  been  written  in  such  a  way  as  to 
permit  easy  generalization  to  the  higher  order  structure  seminvariants,  and  the  final 
results  for  the  four-  and  five-phase  structure  seminvariants  in  PI  have  been  derived.  It  is 
expected  also  that  the  work  described  here  will  serve  as  a  useful  guideline  for  the 
derivation  of  analogous  distributions  in  other  space  groups,  noncentrosymmetric  as 
well  as  centrosymmetric. 

8.  Direct  methods  and  isomorphous  replacement 

8.1  Introduction 

Direct  methods  are  routinely  used  nowadays  for  the  solution  of  small  molecular 
structures,  i.e.  those  containing  fewer  than  some  100  independent  non-hydrogen  atoms 
per  unit  cell.  For  the  solution  of  macromolecular  structures  on  the  other  hand,  the 
method  of  isomorphous  replacement  is  universally  used.  One  naturally  anticipates  that 
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the  ability  to  combine  the  two  techniques  would  yield  methods  more  powerful  than 
either.  This  fusion  of  the  two  techniques  is  described  here,  and  the  initial  applications 
(Hauptman  1982a;  Hauptman  et  al  1982)  strongly  suggest  that  the  anticipated  gain  in 
power  is  in  fact  realized. 

8.2  The  normalized  structure  factors 

If fj  and  gj  denote  atomic  structure  factors  for  a  corresponding  pair  of  isomorphous 
structures  in  PI,  then  the  respective  normalized  structure  factors  E^^^Lnd  G^are  defined 
by 

£h  =  £h  exp  ((0h)  =  All  Z  fj  exp  i2niH-Tj),  (97) 

a  20  j  =  l 

Gh  =  |Gh|  exp  (iiAh)  =  Aji  Z  dj  exp  (IniH  ■  rf,  (98) 

“02  j  =  1 

where 

=  Z  fjdf  (99) 

j=  1 

some  of  the  ffs  (or  gfs)  may  be  zero  (or  negative  in  the  neutron  diffraction  case)  and  is 
the  position  vector  of  the  atom  labelled  j.  This  formulation  includes  not  only  the  case  of 
two  strictly  isomorphous  structures  but  also,  for  example,  the  special  case  that  the  / 
structure  is  a  native  protein  and  the  g  structure  a  heavy-atom  isomorphous  derivative, 
as  well  as  the  case  that  one  or  both  sets  of  data  are  obtained  by  neutron  diffraction.  The 
fj  and  gj  are  zero-angle  atomic  scattering  factors  and  are  therefore  equal  to  the  atomic 
numbers  Zj  in  the  x-ray  diffraction  case  but  may  be  negative  in  the  neutron  diffraction 
case. 


8.3  The  structure  invariants 

If  H,  K,  and  L  are  reciprocal  lattice  vectors  satisfying 

H  +  K  -f  L  =  0  (100) 

then,  in  sharp  contrast  to  the  non-isomorphous  case,  there  exist  eight  different  kinds  of 
three-phase  structure  invariants: 

Co  ~  ^L’  (iOi) 

Cl  =  </>H  + •Al’  Cl  =  0hT CT  =  +  (102) 

C2  =  <^h+'Ak  +  *Al’  C'2  =  •Ah+^/’k+'Al’  C'i  =  <AhT'Ak+^l>  (103) 

C3  =  (Ah  +  '/'k+'/'l-  (104) 


8.4  The  first  neighbourhood  of  each  of  the  three-phase  structure  invariants  (101)-(104) 

Again  in  contrast  to  §6.  la,  the  first  neighbourhood  of  each  of  the  three-phase  structure 
invariants  (101)-(104)  is  now  defined  to  consist  of  the  six  magnitudes 
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Thus  estimates,  for  example,  of  even  the  “pure”  structure  invariants  Co  [equation  (101)] 
are  made  to  depend  on  the  values  of  six  magnitudes,  (105)  and  (106),  rather  than  on  the 
three  magnitudes  (105)  alone,  as  in  the  non-isomorphous  case. 

8.5  The  conditional  probability  distribution  of  the  three-phase  structure  invariant 

It  is  assumed  that  an  isomorphous  pair  of  structures  in  PI  with  atomic  position  vectors 
Tj,  r2  .  .  .  ,  Tf^jis  fixed  and  that  the  six  non-negative  numbers  Pj,  R2,  P3,  Sj,  S2,  are 
specified.  Denote  reciprocal  space  by  IT  and  by  WxWx  IT  the  three-fold  cartesian 
product  consisting  of  all  ordered  triples  (H,  K,  L)  of  reciprocal  vectors.  Suppose  that  the 
primitive  random  variable  is  the  triple  (H,  K,  L)  which  is  assumed  to  be  uniformly 
distributed  over  the  subset  of  IT  x  IT  x  IT  defined  by  (100)  and 


I^hI  ~  -^1’ 

I-^kI  ~  ■^2,  I^lI  ■“ 

(107) 

I^hI  “  ^1’ 

I^kI  ~  “^2,  I^lI  ~  “^3, 

(108) 

where  the  normalized  structure  factors  E  and  G  are  defined  in  (97)-(99).  Then  the  eight 
three-phase  structure  invariants,  Co>  Ci»  Ci  •  ■  •  C3  [equations  (101)-(104)],  as 
functions  of  the  primitive  random  variable  (H,  K,  L),  are  themselves  random  variables. 
The  conditional  probability  distribution  of  Co,  for  example,  assuming  as  known  the  six 
magnitudes  \E^^\,  |^h[  |Gk|>  I^lI  [equations  (107)  and  (108)]  is  given  by 

(Hauptman  1982a) 

P 0(^0 1^1,  ^2,  ^3!  ,  '^2,  ‘^3)  ~  Tr~  ('^o  Qq)’  (109) 

where 

Ke  =  27r/o(/io),  (110) 

and  Aq  in  turn  is  explicitly  expressible  in  terms  of  the  atomic  scattering  factors  fj,gj, 
;  =  1,  2, .  .  . ,  iV,  and  the  six  magnitudes  |jEh|,  jp^l,  \El\,  |Gh|,  |G,j|,  |Gl|  [equations  (107) 
and  (108)].  Similar  formulas  hold  for  Pi(Di|Pi,  Rj,  R3,  S2,  S3),  the  conditional 

probability  distribution  of  Ci,  assuming  as  known  the  six  magnitudes  |£„|,  |£k|, 

I^hI’  I^kI’  I^lI  [equations  (107)  and  (108)],  etc.  (109)  is  seen  to  be  identical  in  form  to 
(22).  However  the  parameter  Aq  of  (109),  in  sharp  contrast  to  the  parameter  A  of  (22), 
may  be  positive  or  negative.  Thus  (109)  has  a  single  maximum  in  the  interval  (0,  In)  and 
this  maximum  occurs  at  0  or  tt  according  to  /Iq  >  0  or  Aq  <  0.  Again,  the  larger  the 
value  of  I  Aq  I  the  smaller  is  the  variance  of  the  distribution.  Thus,  when  |  Aq  |  is  targe  one 
obtains  the  reliable  estimate 

Co  ~  0  or  Co  ~  (111) 

according  as 

Ao  ^  0  or  Ao  0,  (1 12) 

respectively.  Just  as  (24)  has  proved  to  be  of  great  importance  in  the  development  of  the 
traditional  techniques  of  direct  methods,  it  is  natural  to  assume  that  the  estimates  (111), 
and  the  like  estimates  for  the  remaining  invariants  ( 1 0 1 )-( 1 04),  will  play  a  similar  role  in 
the  development  of  techniques  combining  the  traditional  direct  methods  and 
isomorphous  replacement.  In  particular,  one  anticipates  the  existence  of  a  generalized 
tangent  formula  which  takes  into  account  the  estimates  (111). 
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9.  Direct  methods  and  anomalous  dispersion 

9.1  Introduction 

It  has  been  known  for  a  long  time  (Peerdeman  and  Bijvoet  1956;  Ramachandran  and 
Raman  1956;  Okaya  and  Pepinsky  1956)  that  the  presence  of  one  or  more  anomalous 
scatterers  facilitates  the  solution  of  the  phase  problem;  and  some  recent  work  (Kroon  et 
al  1977;  Henerman  et  al  1978)  employing  Bijvoet  inequalities  and  the  double  Patterson 
function,  leads  in  a  similar  way  to  estimates  of  the  sines  of  the  three-phase  structure 
invariants.  This  work  strongly  suggests  that  the  ability  to  integrate  the  techniques  of 
direct  methods,  in  particular  the  recent  advances  in  the  mathematical  formalism,  with 
anomalous  dispersion  would  lead  to  improved  methods  for  phase  determination.  The 
fusion  of  these  techniques  is  described  here  and  the  anticipated  improvement  is  in  fact 
realized  (Hauptman  1982b).  Not  only  do  the  new  formulas  lead  to  improved  estimates 
of  the  structure  invariants  but,  more  important  still,  because  the  distributions  derived 
here  are  unimodal  in  the  whole  interval  (0,  2n),  the  two-fold  ambiguity  inherent  in  all 
the  earlier  work  is  removed.  It  is  believed  that  this  resolution  of  the  two-fold  ambiguity 
results  from  the  ability  now  to  make  use  of  the  six  individual  magnitudes  in  the  first 
neighbourhood  of  the  structure  invariant  and  the  avoidance  of  explicit  dependence  on 
the  Bijvoet  differences;  the  explicit  use  of  the  Bijvoet  differences,  as  is  done  in  all 
previous  work,  leads  apparently  to  a  loss  of  information  resulting  in  a  two-fold 
ambiguity  in  estimates  of  the  structure  invariants.  Since,  in  the  presence  of  anomalous 
scatterers,  the  observed  intensities  are  known  to  determine  a  unique  enantiomorph,  and 
therefore  unique  values  for  all  the  structure  invariants,  formulas  of  the  kind  described 
here  should  not  be  unexpected;  nevertheless  not  even  their  existence  appears  to  have 
been  anticipated. 

9.2  The  normalized  structure  factors 

In  the  presence  of  anomalous  scatterers  the  normalized  structure  factor 

£„  =  l^H  I  exp  (/(/)„  X  (113) 

is  defined  by 

^  ;7T72  TfjH  exp  (27ciH  •  r^)  (114) 

“h  j  =  1 

=  rT72  I  |^Hlexp[i(^^.H  +  27cH  r,)],  (115) 

j  =  1 

where 

fjH  ~  |.^h|  exp(j^^H),  (116) 

is  the  (in  general  complex)  atomic  scattering  factor  (a  function  of  |  H|  as  well  as  of  j)  of 
the  atom  labeled  j,  is  its  position  vector,  N  is  the  number  of  atoms  in  the  unit  cell,  and 

(117) 

j  =  1 

For  a  normal  scatterer,  =  0;  for  an  atom  which  scatters  anomalously,  f  0. 
Owing  to  the  presence  of  the  anomalous  scatterers  the  atomic  scattering  factors ,  as 
functions  of  sin  0/2,  do  not  have  the  same  shape  for  different  atoms,  even  ap- 
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proximately.  Hence  the  dependence  of  the/^H  on  |H|  cannot  be  ignored,  in  contrast  to 
the  usual  practice  when  anomalous  scatterers  are  not  present.  For  this  reason  the 
subscript  H  is  not  suppressed  m  the  symbols  and  [equation  (117)]. 

9.3  The  structure  invariants 

It  will  be  assumed  throughout  that  H,  K,  L  are  fixed  reciprocal-lattice  vectors  satisfying 

H  +  K  +  L  =  0.  (118) 

Owing  to  the  breakdown  of  Friedel’s  law  there  are  now  eight  distinct  three-phase 
structure  invariants: 


Co  ~  T 

(119) 

Cl  ~  "b  L  </>L’ 

(120) 

C2  ~  ~  T  ft, 

(121) 

C3  “  0H  "b  fk  “  ft’ 

(122) 

Cff  =  fH  +  fK  +  ft’ 

(123) 

1,1=  ~  fuT  fig:T  fi^, 

(124) 

kj  =  fH~fK  +  fL’ 

(125) 

Ct  =  fH  +  fK~fv 

(126) 

9.4  The  first  neighbourhood  of  each  of  the  three-phase  structure  invariants  (1 19F(  126) 

In  analogy  with  §8.4,  the  first  neighbourhood  of  each  of  the  three-phase  structure 
invariants  (119F(126)  is  defined  to  consist  of  the  six  magnitudes 


1  1’ 

1  F  1 

1  1  ’ 

\Et\’ 

(127) 

1  F-i 

1  h 

\Et:\’ 

(128) 

which,  because  of  the  breakdown  of  Friedel’s  Law,  are  not,  in  general,  equal  in  pairs. 

9.5  The  conditional  probability  distribution  of  the  three-phase  structure  invariant, 
assuming  as  known  the  six  magnitudes  in  its  first  neighbourhood. 

Fix  the  reciprocal-lattice  vectors  H,  K,  and  L,  subject  to  (118).  Suppose  that  the  six  non¬ 
negative  numbers  Ri,  Rj,  R^,  Rj  Rj,  and  R^  are  also  specified.  Define  the  A'^-fold 
Cartesian  product,  V,  to  consist  of  all  ordered  /V-tuples  (fj,  r2,  .  .  .  ,  r^),  where  Tj,  r2, 
.  .  .  ,  fjy  are  atomic  position  vectors.  Suppose  that  the  primitive  random  variable  is  the 
Af-tuple  (fi,  r2,  .  .  .  ,  r^)  which  is  assumed  to  be  uniformly  distributed  over  the  subset 
of  V  defined  by 


|£hI 

tij 

II 

II 

hJ 

r 

=  «3, 

(129) 

|£ni 

—  Ej’  l^ltl 

II 

(130) 

where  the  normalized  structure  factors  E  are  defined  by  (114).  Then  the  eight  structure 
invariants 


Cj,  Cj,  j  -  0,  1,  2,  3, 


(131) 
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(119)-(126),  as  functions  of  the  primitive  random  variables  (rj,  T2,  ,  r^^),  are 

themselves  random  variables.  Denote  by 


(132) 


the  conditional  probability  distribution  of  the  three-phase  structure  invariant  Cj,  j  =  0. 
1,  2,  3,  0,  1,  2,  3,  given  the  six  magnitudes  (129)  and  (130)  in  its  first  neighbourhood. 
Then  the  major  result  is  that  the  distributions  have  the  pure  exponential  form 

(Hauptman  1982b): 


(133) 


where  the  parameters  Kj,  Aj  and  coj  are  expressible  in  terms  of  the  complex  scattering 
factors presumed  to  be  known,  and  the  observed  magnitudes  |  £h  U  I>  I 
\Err\,\  £^|,  No  prior  knowledge  of  the  positions  of  the  anomalous  scatterers  is 
needed,  nor  is  it  required  that  the  anomalous  scatterers  be  identical.  Since  the  K/s  and 
A/s  are  positive,  the  maximum  of  (133)  occurs  at  =  (Oj.  Hence  when  the  variance  of 
the  distribution  is  small,  i.e.  when  Aj  is  large,  one  obtains  the  reliable  estimate 


(134) 


Cj  =  coj,  j  -  0,  1,  2,  3,  0,  1,  2,  3 


for  the  structure  invariant  Cj,  [(1 19)-(126)].  Once  again  a  generalized  tangent  formula, 
based  on  the  estimates  (134)  for  the  invariants  Cj,  is  readily  formulated  and  is  expected 
to  yield  estimates  for  the  individual  phases  in  the  case  that  anomalous  scatterers  are 
present. 

10.  Concluding  remarks 

The  basic  principles  of  the  direct  methods  of  x-ray  crystallography  have  been  described. 
The  fundamental  role  of  the  structure  seminvariants,  that  they  serve  to  link  the 
observed  magnitudes  with  the  desired  phases  of  the  structure  factors,  has  been 
emphasized.  The  neighbourhood  principle,  which  affirms  the  intimate  relationship 
which  exists  between  a  structure  seminvariant  T  and  one  or  more  small  sets  of 
magnitudes  |  E  |,  the  neighbourhoods  of  T,  has  been  formulated.  Reduction  of  the 
probabilistic  theory  of  the  structure  seminvariants  T  to  that  of  the  structure  invariants 
has  been  accomplished  by  embedding  T  and  related  seminvariants  in  suitable  structure 
invariants,  the  extensions  of  T.  Estimates  of  the  structure  invariants  and  seminvariants 
T  are  obtained  by  means  of  appropriate  conditional  probability  distributions,  assuming 
as  known  the  magnitudes  jEl  in  the  neighbourhoods  of  T.  Finally,  the  traditional 
techniques  of  direct  methods  have  been  combined  with  isomorphous  replacement  and 
anomalous  dispersion,  thus  opening  up  the  possibility  that  the  existing  machinery  of 
direct  methods  may  be  carried  over  without  essential  change  to  facilitate  the  solution  of 
macromolecular  structures  via  isomorphous  replacement  and  anomalous  dispersion. 
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The  efficient  generation  of  phase  sets  by  the  random  approach 


K  HENDERSON  and  M  M  WOOLFSON 

Department  of  Physics,  University  of  York,  Heslington,  York  YOl  5DD,  UK 

Abstract.  It  is  shown  that  a  large  number  of  plausible  sets  of  phases  satisfying  triple-phase 
relationships  may  be  found  very  efficiently  and  cheaply.  Sets  of  phases  satisfying  the  original 
set  of  phase  relationships  (<^s)  may  be  combined  with  ‘solutions’  (i^s)  for  the  set  of  relationships 
with  any  translationally-derived  constant  angles  removed.  An  application  to  a  known  structure 
demonstrates  the  validity  of  the  technique  and  a  theoretical  interpretation  is  given  for  a  case 
where  a  true  solution  results  from  a  (f>  and  tj/  combination. 

Keywords.  Direct  methods;  phase  sets;  random  approach 


1.  Introduction 

An  individual  triple-phase  relationship,  with  phases  expressed  in  cycles,  may  be  written 
as 

</)p  ±  ±  +  h  S-  /I,  (1) 

where  the  constant  angle  b  may  arise  from  translational  symmetry,  ^5;  means  ‘is 
probably  close  to’  and  n  is  some  integer.  A  complete  set  of  such  relationships  may  be 
written  in  matrix  notation  as 

K{\(f) +  h)  ^  Kn,  (2) 

where  the  elements  of  the  diagonal  matrix  k  are  the  usual  k  values,  2N“2  \EpE^E^  \  for 
an  equal-atom  structure. 

In  the  YZARC  method,  developed  from  ideas  given  by  Baggio  et  al  (1978),  initial 
random  values  are  assigned  to  about  100  phases  which  are  subsequently  refined  by  an 
iterative  process  to  satisfy  equation  (2).  Some  50^-100  sets  of  refined  phases  are  then 
extended  by  the  tangent  formula  and  thereafter  treated  and  evaluated  in  the  same  way 
as  sets  of  phases  generated  by  multan  80  (Main  et  al  1980). 

The  YZARC  method  has  been  extremely  successful  in  solving  structures  with  up  to  100 
atoms  or  so  in  the  asymmetric  unit.  However  it  transpires  that,  in  general,  the  larger  the 
structure  the  greater  is  the  number  of  phase  sets  which  need  to  be  examined  in  order  for 
a  correct  one  to  appear.  For  a  small  protein  it  may  be  necessary  for  many  thousands  of 
phase  sets  to  be  generated  in  order  to  obtain  just  one  which  may  reveal  the  structure. 
This  would  be  very  expensive  in  computer  time  if  done  in  a  straigtforward  way  but,  as 
will  be  demonstrated,  an  economical  process  is  available. 


2.  ij/  sets 


Consider  a  set  of  equations 
kA\I/  =  Km 


3 


(3) 
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where  A  is  the  same  matrix  as  in  (2)  and  m  is  a  vector  whose  elements  are  close  to 
integers.  The  yzarc  process  may  be  used  to  find  i/^’s  which  satisfy  (3). 

A  (t>  solution  of  (2),  </>;,  and  a  ij/  solution  of  (3),  if/j,  are  combined  to  give 


A((/>,-  +  if/j)  +  b}  «  K:(ni  ±  m^). 


(4) 


Since  the  elements  of  n,  ±  will  be  close  to  integers  then  (pi  ±  ij/j  will  be  near  solutions 
of  (2).  If  there  are  P  (p  solutions  and  Q  \p  solutions  then  there  will  be  P{2Q  +  1)  available 
solutions  or  near-solutions  of  (2).  The  advantage  of  this  is  apparent  when  we  consider  a 
numerical  case.  With  P  —  Q  =  50  then  P{2Q  +  1)  =  5050  and  this  number  of  possible 
phase  sets  is  obtained  with  little  more  computer  time  than  is  required  to  generate  100 
sets  of  phases  with  yzarc. 

A  question  which  arises  is  the  following:  ‘For  each  (p  solution  is  there  a  ip  solution 
which  can  be  combined  with  it  to  give  an  almost-correct  set  of  phases?’ 

If  (p,  is  the  set  of  true  phases  then  for  a  particular  solution  of  (2),  <pi,  a  correction 
vector  would  be 

^4>i  =  4>t-<Pi- 

If  A</),  is  substituted  into  (3)  and  refined  and  if  the  refined  elements  are  close  to  their 
initial  values  (  ^  40*"  different,  say)  then  we  may  assume  that  there  is  an  obtainable  ip 
which  will,  combined  with  </>,,  give  a  substantially  correct  solution. 

Tests  with  several  structures  indicate  that  for  about  20%  of  (p  solutions  there  is  a 
potentially-available  \p  which  will  lead  to  a  structure  solution. 

3.  Figures  of  merit  for  small  sets  of  phases 

In  the  process  which  is  to  be  described  it  is  necessary  to  associate  a  figure  of  merit  (fom), 
a  measure  of  plausibility,  to  sets  of  100  or  less  phases.  It  is  well-known  that  the  foms 
used  in  multan  are  very  effective  but  these  are  applied  to  phase  sets  which  define  the 
structure  well,  in  the  sense  that  a  Fourier  synthesis  would  show  the  correct  structure,  or 
most  of  it,  quite  clearly. 

We  have  found  a  group  of  foms,  suitable  for  a  small  data  set,  which  can  be  expressed 
in  terms  of  the  quantities 


«h  ^  IZ'<h,h'exp{i(0h-</'h'  -</>h-h')}|- 

U' 


h 


(5) 


These  are; 


(i)  ABSFOM  =  Z|ah,|/Z 


(6) 


h 


h 


(ii)  Ri  =  Z|ah,.-K|/Z“h,.’ 


(7) 


h  h 


(iii)  R2  =  {Z(^“h..-«h)VZ“/../}^’ 


(8) 


h  b 


and  (iv)  MR  =  min  (a^/a^ 

h 


(9) 
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where  an  estimate  of  the  theoretically  expected  value  of  is 


~  Z  '^h.h'  (H,h')/^0  i^h,h'X 

h 

(10) 

(11) 

h  h 

and  s  =  X«h..«h/I«h,e" 

(12) 

h  h 

which  makes  i?2  a  minimum. 

These  foms  are  not  good  as  absolute  figures  of  merit  but  they  do  seem  to  work  quite 
well  as  ranking  figures.  In  a  large  number  of  experiments  it  has  been  found  that 
combining  a  if/  with  a  <f)  usually  lowers  absfom.  For  a  few  combinations  there  is  a  small 
increase  in  absfom. 

Since  a  promising  (t>±\l/  should  have  a  reasonably  large  absfom  (  ^  1,  say)  then  only 
those  <f>  solutions  for  which  absfom  ^  0-95  have  been  used  for  combining  with  ^s.  This 
rule  of  thumb  may  need  to  be  modified  in  the  light  of  further  experience. 


4.  A  trial 

The  idea  of  combining  (f>  and  if/  sets  was  tested  with  the  known  structure  of  cortisone, 
C21H28O5;  P2i2,2i;  a  =  10-040,  b  =  23-649,  c  =  7-784 A;  Z  =  4  (figure  1). 

For  a  normal  yzarc  run  100  sets  of  100  phases  were  refined  to  satisfy  equation  (2), 
expanded  to  204  phases  and  then  analysed  by  the  usual  mult  an  procedure.  Four  correct 
solutions  were  found — sets  46, 57, 59  and  98;  these  revealed  all  the  non-hydrogen  atoms 
of  the  structure. 

For  each  of  the  100  (f>s  (each  containing  100  phases)  from  the  yzarc  run  foms  were 
calculated.  There  were  then  derived  51  if/  sets  by  running  yzarc  with  (3).  Each  (ft  for 
which  absfom  ^  0-95  was  combined  with  each  ij/  solution  and  SL<f)±\l/  combination  was 
investigated  further  if  its  absfom  ^  0-95  and  if  its  Ri  was  smaller  than  that  for  4>  alone. 

Each  of  the  384  combinations  satisfying  these  criteria  was  extended  to  204  phases  by 
the  tangent  formula  and  then  subjected  to  the  usual  multan  figures  of  merit.  Of  the  384 
combinations  26  gave  the  entire  structure. 

For  each  of  the  original  <t>±tl/  combinations  (with  100  phases)  the  foms  (6)  to  (9)  were 
calculated  and  a  combined  figure  of  merit,  cfom  ,  produced  from  them  (as  is  done  in 


Figure  1.  A  molecule  of  cortisone. 
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Table  1.  The  26  combinations  giving  a  solution  for  cortisone 


Set 

0 

•A 

CFOM 

Set 

</> 

•A 

CFOM 

71 

14 

21 

3.30 

378* 

98 

-3 

2.67 

290’' 

59 

-32 

3.12 

212* 

46 

3 

2.58 

315 

71 

31 

2.94 

365 

92t 

-5oy 

2.55 

215* 

46 

-19 

2.92 

188 

43t 

-50v' 

2.55 

220* 

46 

24 

2.90 

189 

43t 

50V 

2.55 

219* 

46 

-24 

2.81 

211* 

46 

-  3 

2.54 

223* 

46 

-32 

2.80 

153 

31t 

50V 

2.54 

216* 

46 

19 

2.78 

366 

92t 

50V 

2.49 

270* 

57 

19 

2.77 

278* 

57 

51 

2.46 

269* 

57 

-19 

2.77 

152 

31t 

-50y 

2.46 

273* 

57 

-24 

2.73 

264* 

57 

-  3 

2.45 

274* 

57 

24 

2.70 

279* 

57 

32 

2.45 

381* 

98 

-32 

2.68 

265* 

57 

3 

2.34 

*  ^  trivial  with  all  elements  nearly  equal  zero. 
+  These  (^s  are  identical. 

i/'jo  has  all  elements  close  to  0  or  n. 


MULTAN).  Table  1  shows  the  26  solutions  in  ranking  order  of  cfom. 

Set  71  had  the  highest  cfom  of  all  the  384  combinations  produced  and  led  to  an  E 
map  of  exceptionally  high  quality.  It  is  also  worth  noting  that  set  265,  the  correct 
solution  with  the  lowest  cfom  was  161st  in  ranking  order  for  the  384  combinations  so 
that  all  the  correct  solutions  were  comfortably  contained  in  the  top  half  of  the  ranking 
order. 

5.  An  interpretation 

Set  71,  that  with  the  highest  cfom,  corresponded  to  the  combination 
Figures  of  merit  were: 

for  ABSFOM  =  10()0  Rj  =  0-220 

R2  =0-231  MR  =  0-059 

for  <1)14. +  11^21  ABSFOM  =  1-060  R,  =  0-201 
R,  =0-198  MR  =  0-290. 

It  can  be  seen  that  all  the  figures  of  merit  are  better  for  the  combination  than  for  (p  14 
alone.  This  type  of  behaviour  is  rare  and  in  our  limited  experience  seems  nearly  always 
to  indicate  that  a  correct  solution  has  been  found  by  combination. 

In  figure  2(a)  there  is  shown  a  peak-interpreted  E  map  for  (/),4  and  in  figure  2(b)  the 
corresponding  map  for  +  p2i-  It  is  clear  that  (P14.  is  showing  a  partially  correct 
fragment  of  the  molecule  but  displaced.  In  figure  2(b)  the  peaks  are  ringed 
corresponding  to  part  of  the  configuration  shown  in  figure  2(a).  The  coordinates  of  two 
corresponding  points  are  shown  and  it  is  clear  that  the  displacement  of  the  fragment  in 
figure  2(a)  is  approximately  (0-44,  0,  0). 

Let  us  assume  that  for  each  phase 


(13) 
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Figure  2.  a.  The  fragment  from  <^,4.  b.  The  molecule  from  <^,4 


where  (|),^^  is  the  true  phase.  Then  since 

£h  =  |£h!  exp  (/■(/), h),  (14) 

we  may  write 

£h  =  i  £h  I  exp  )  exp  {iij/.  (1 5) 

A  ‘true'  £  map  is  the  Fourier  transform  of  the  set  of  £(,.  Hence  it  is  the  convolution  of 
the  £  map  obtained  using  the  phases  (/>,  with  the  map  obtained  using  phases  if/ ,  and  unit 
amplitudes.  This  latter  map  we  may  call  the  error  map  (Silva  and  Viterbo  1980). 

The  error  map  calculated  from  021  has  as  its  largest  peaks; 


Height 

.V 

t' 

- 

3931 

0-4326 

0-0000 

0-0000 

2108 

0-4312 

0-0941 

0-8673 

1927 

0-4358 

0-0901 

0-1737 

1875 

0-4330 

0-0036 

0-6953 

1594 

0-0346 

0-0276 

0-3380 

We  can  recognise  the  highest  peak  as  the  shift  in  the  fragments  displayed  in  the  two 
parts  of  figure  2.  This  way  of  thinking  about  errors  in  sets  of  phases  generated  by  direct 
methods  can  be  most  productive  and  may  lead  to  even  more  efficient  processes  for 
generating  large  numbers  of  plausible  phase  sets. 
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6.  Conclusion 

The  tests  carried  out  with  cortisone  show  that  the  idea  of  combining  (f)  and  ij/  solutions 
is  basically  sound.  Some  questions  to  which  answers  are  needed  are: 

(i)  Can  we  recognise  and  eliminate  duplicate  0  and  ij/  sets? 

(ii)  Is  the  number  of  different  ij/  sets  fairly  small  and  if  so  how  do  we  find  all  of  them? 

(iii)  Can  we  find  even  better  figures  of  merit  for  small  sets  of  phases? 

For  really  complicated  structures  (even  small  proteins)  many  thousands  or  tens  of 
thousands  of  sets  of  phases  must  be  investigated.  While  it  seems  that  we  can  easily 
generate  many  sets  we  must  find  good  and  fast  ways  to  evaluate  their  plausibility. 
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An  assessment  of  criteria  of  Ht  in  Patterson  search 


C  E  NORDMAN 

Department  of  Chemistry,  University  of  Michigan,  Ann  Arbor,  Michigan  48109,  USA 

Abstract.  The  problem  of  reliably  detecting  a  known  set  of  n  vectors  of  weight  w,  embedded  in  a 
heavily  overlapped  Patterson  function  P,-  is  investigated  by  a  Monte-Carlo  simulation  based 
on  searches  of  computer-generated  random  number  sequences.  Several  formulations  of  the 
criterion  of  fit  were  compared.  All  were  found  to  improve  when  the  criterion  was  based  on  a 
subset  of  m  “worst  fitting”  vectors  as  judged  by  a  low  value  of  (PJWi).  The  best  criteria  were 
rr=  1  1 with  m»  (04-05)n,  Wj,with  mx  0-3n,  and  I-T=i(^iPi) 

with  m  X  0-ln.  In  each  case  the  detectability  of  the  embedded  vectors  W;  increases  with 
increasing  ujw)  in  relation  to  (t(N),  the  standard  deviation  of  the  overlaid  noise.  A  related 
simulation  of  a  Patterson  search  for  non-crystallographic  symmetry  shows  that  for  a  given  size 
of  the  non-crystallographically  symmetric  region,  the  detectability  increases  with  the  order  (2- 
fold,  6-fold,  12-fold)  of  the  symmetry. 

Keywords.  Patterson  search;  signal  detection;  Monte-Carlo  simulation. 


1.  Introduction 

Crystallographic  structure-solving  techniques  based  on  interpretation  of  the  Patterson 
function  typically  involve  computer-implemented  systematic  sampling  or  exhaustive 
search  of  the  Patterson  function.  The  manner  in  which  the  Patterson  function  is 
sampled  reflects  the  nature  of  the  available  a  priori  information  concerning  the 
unknown  crystal  structure. 

This  information  may  be  the  knowledge  of  the  structure  of  some  (rigid)  fragment 
present  in  the  molecule.  In  this  case  the  Patterson  function  is  sampled  at  points 
corresponding  to  the  set  of  vectors  within  this  fragment,  or  between  two  properly 
oriented  copies  of  the  fragment.  Alternatively,  the  available  information  may  be  the 
knowledge  that  the  molecule — perhaps  a  multisubunit  protein,  or  a  virus — possesses 
local  non-crystallographic  symmetry  of  a  particular  kind.  In  this  case  the  Patterson 
function  may  be  sampled  with  a  movable  “symmetry  grid,”  an  array  of  points  in 
spherical  polar  coordinates,  which  possesses  the  exact  point  group  symmetry  of  the 
molecules  (Nordman  1980a). 

A  recent  review  (Nordman  1980b)  describes  these  methods  in  greater  detail  and  gives 
examples  of  their  use  in  small-molecule  crystallography. 

In  either  type  of  search  a  criterion  of  fit,  or  “image-seeking  function,”  is  evaluated  at 
each  step  in  the  search.  Collectively,  these  values  constitute  a  “map,”  generally  in  three 
dimensions,  the  coordinates  being  angular  or  translational  depending  on  the  nature  of 
the  search.  Promising  orientations  or  translational  positions  of  the  search  object  are 
indicated  by  maxima,  or  minima,  in  the  map. 

In  small-molecule  problems,  where  the  rigid  fragment  constitutes  most  or  all  of  the 
molecule,  the  exact  formulation  of  the  criterion  of  fit  is  not  very  crucial.  Criteria  used  in 
such  cases  include  maximizing  the  sum  of  the  sampled  values  of  the  Patterson  function 
(Braun  et  al  1969),  or  minimizing  the  sum  of  the  squares  of  the  difference  between  the 
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fragment  Patterson  and  the  crystal  Patterson  at  all  points  where  the  former, 
unacceptably,  exceeds  the  latter  (Huber  1965). 

The  problem  of  making  the  search  as  discriminating  as  possible  was  first  considered 
by  Schilling  (1970).  He  showed  that  it  is  advantageous  to  sort  the  sampled  Patterson 
values  l\  and  the  weights  h’,  of  the  known  sampling  vectors,  in  order  of  increasing 
values  of  the  ratio  P,/vVj.  The  lower  the  value  of  this  ratio,  the  worse  is  the  fit.  that  is,  the 
more  poorly  is  the  fragment  vector  peak  Wj  accommodated  by  the  Patterson  value  P,. 
By  including  in  the  criterion  of  fit  only  those  (w^,  P,)  values  which  are  most 
discriminating,  as  indicated  by  low  Schilling  showed  that  more  reliable  search 

results  were  obtained.  The  “minimum  average”  is  defined  as 


(1) 


where  n  is  the  total  number  of  fragment  vectors  used,  and  m  is  the  size  of  the  subset 
having  the  m  lowest  values  of  P,/  h',.  This  criterion  of  fit  has  found  wide  use,  typically 
with  mm  —  01  -0  3. 

Another  reasonable  criterion  of  fit  is  the  sum  of  the  products  of  the  search  vector 
weights  vv'i  and  the  sampled  Patterson  values  P,.  This  function 


(2) 


I  =  I 


tends  to  be  high  at  the  correct  solution.  Recognizing  that  W;  represents  points  in  a 
“model”  Patterson,  P„,  and  replacing  the  sum  with  an  integral,  it  is  seen  that  (2)  is 
related  to  |  P„,(r)P(r)dr.  This  integral,  evaluated  as  a  sum  of  products  of  Patterson 
coefficients,  is  the  criterion  of  fit  used  in  reciprocal-space  search  methods,  for  example, 
the  widely  used  rotation  function  (Rossmann  and  Blow  1962). 

In  order  to  assess  the  potential  value  of  Patterson-space  search  techniques  in 
macromolecular  crystallography,  it  is  of  interest  to  examine  different  criteria  of  fit  in  the 
hope  of  finding  the  one  which  is  most  promising  in  the  unfavourable  case  of  a  very 
heavily  overlapped  Patterson  function.  It  has  recently  been  shown  (Nordman  and  Hsu 
1982)  that  a  Monte-Carlo  calculation  which  simulates  a  Patterson  search  can  be 
formulated  as  a  one-dimensional  problem  of  detecting  a  sequence  of  known  numbers 
embedded  in  a  longer  sequence  of  random  numbers.  On  the  basis  of  relatively  limited 
statistics  it  was  concluded  that  the  criterion  (1),  with  n  =  3(X),  m  =  50--100,  was  more 
successful  in  finding  the  “correct”  solution  than  criterion  (2). 

In  this  communication  a  slightly  modified  Monte-Carlo  calculation  is  used  to 
examine  several  criteria  of  fit  in  the  light  of  much  more  extensive  statistical  material. 
Also,  a  related  formulation  is  employed  in  a  Monte  Carlo  simulation  of  a  search  for 
local  non-crystallographic  symmetry. 

2.  Simulated  structure  search 

In  an  actual  Patterson  search  n  vectors  of  weight  vv,  scan  the  Patterson  function, 
returning,  at  each  point  in  the  search  a  set  of  values  P,,  i  =  1.  .  .  .  ,  n.  The  arrays  P;  and 
W;  are  sorted  in  ascending  order  of  (Pi/Wi)  and  criteria  of  fit  based  on  the  first  m  entries 
in  the  sorted  arrays,  where  m  ^  n. 

A  rapidly  computable  simulation  of  this  is  as  follows.  Let  the  “Patterson”  to  be 
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searched  be  represented  by  P,  —  5^  +  A/,  where  S;  and  A/,-  are  sequences  of  random 
numbers  with  positive  means  {S')  and  {N)  and  standard  deviations  a{S)  and  a{N). 
These  sequences  are  of  equal  length  /,  here  taken  as  500. 

The  n  search  vectors  in  this  simulation  are  a  positionally  significant  subset  of 
the  sequence  S;.  Without  loss  of  generality  the  Wj  can  be  taken  as  a  contiguous  sequence 
Wj  =  ^j+kQ  where  the  (/cq  +  l)th  entry  in  the  S,  sequence  is  the  first  in  Wj.  The  w/s  are 
treated  as  the  “known”  search  vector  weights;  n  was  taken  as  300  here.  The  search  is 
carried  out  by  translating  the  Wj  sequence  along  the  P;  sequence,  allowing  the  vv^’s  to 
sample  Pj^^  for  successive  values  of  k,  from  zero  tol  —  n.  At  each  of  the  201  steps  in  the 
search  the  data  are  sorted,  and  several  criteria  of  fit  evaluated.  A  given  criterion  is  judged 
successful  if  it  assumes  a  higher  value  when  /c  =  /cq  than  for  any  of  the  200  other  values 
of  k.  The  percentage  of  successes  scored  by  a  given  criterion  in  a  large  number  of 
independent  searches  allows  us  to  compare  different  criteria  with  one  another. 

Five  different  criteria  of  fit  were  evaluated.  These  included  the  minimum  average, 
MIN,  as  defined  in  eq.  (1),  a  weighted  minimum  average 

m  I  m 

wmin(wt,  n)  =  ^  (WfP,)  /  X  m  ^  n  (3) 

1=1  /  i  =  i 

and  the  quantity 

m 

Msp(wi,  n)  =  X  (WjP,)  m  ^  n  (4) 

i=  1 

which  is  a  generalization  of  SP  (equation  (2)).  The  criteria  i  (1  +  w,/<w»Pi/ 
Z  1 W;  and  Z  ”=  j  ( P,/w,),  m  ^  n,  were  also  computed.  They  were  less  successful  than 
the  others,  and  are  not  further  discussed. 

Figure  1  shows  the  results  for  the  three  best  criteria  of  fit.  In  these  calculations  the 
noise.  A/,,  was  taken  as  Gaussian  with  a  mean  of  300  and  a  standard  deviation 
(t(N)  =  100.  The  sequence  5;,  including  the  vector  weights  w,  was  also  Gaussian  with  a 
mean  ==  3o-(5),  and  a  (5)  ranging  from  15to  50.  For  each  choice  ofcr(S'),  at  least  100  runs 
were  calculated. 

The  results  are  summarized  in  figure  1.  It  should  be  noted  that  the  ‘noise  level’  of  the 
ordinate  is  1/201  or  0-5  %;  this  would  be  the  statistical  chance  of  a  ‘successful’  search 
with  a  vanishingly  small  cr(-S).  At  the  upper  end,  (t(S')  »  0-5a{N)  essentially  insures 
success. 

All  curves  reach  maxima  at  values  of  m  <  n  —  300.  The  position  of  the  maximum 
appears  to  be  independent  of  a{S)/o{N)  for  any  one  criterion  of  fit. 

With  m  =  300,  the  min  criterion  (equation  (1))  reduces  to  Zf^'j  P;  divided  by  a 
constant.  Since  the  calculation  is  designed  so  that  <w>  ^  <S>,  the  min  criterion  is 
meaningless  at  m  —  300,  and  the  corresponding  points  are  not  shown. 

The  optimal  choice  of  m  for  min  can  be  estimated  to  be  approximately  0-3  n,  in 
agreement  with  past  experience.  The  wmin  criterion  tends  to  have  its  maximum  at 
(0-4-0-5)n,  and  in  essentially  every  run  tends  to  give  slightly  higher  success  rates  than 
MIN.  The  maxima  of  msp  tie  at  approximately  0-7  n,  and  tend  to  be  lower  than  those  of 
either  wmin  or  min.  It  should  be  noted  that  at  m  =  300  msp  is  identical  to  sp 
(equation  (2)).  This  criterion  is  distinctly  inferior  to  any  of  the  others. 

Additional  runs  were  done  with  A/;  unchanged,  but  S;  uniformly  distributed  between 
0  01  and  variable  upper  limits,  up  to  60.  When  compared  to  the  Gaussian  runs  with 
equivalent  (t(5)  no  clear  difference  in  success  rates  could  be  discerned. 
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Figure  1 .  Percentage  of  successful  structure  searches  as  a  function  of  m  <  n  =  300  for  three 
criteria  of  fit  as  defined  in  the  text:  min  (left),  wmin  (centre)  and  msp  (right).  For  each  criterion 
the  percentage  is  shown  for  a(S)/a(N)  =  015  (bottom  graph),  0-2,  0-25,  0-3,  0-4  and  0-5  (top 
graph). 


Finally  the  effect  of  varying  the  mean  of  the  noise  N,  was  explored,  keeping  (t{N) 
constant  at  100.  Lowering  (N  >  to  250 caused  some  deterioration;  raising  it  to  KXX)  or 
above  also  appeared  to  cause  some  decline.  One  may  tentatively  conclude  that  the 
optimal  choice  of  the  constant  term  in  the  Fourier  synthesis  of  the  Patterson  is  3  to  5 
times  (t{P);  this  choice  also  achieves  a  reasonable  compromise  between  numerical 
accuracy  and  packing  density  in  the  computer  memory. 

3.  Simulated  symmetry  search 

In  each  simulated  symmetry  search  a  set  of  50  independent  random  number  sequences 
Fi  =  5i  +  iV,.  (=1,  ...,960 

was  searched.  The  noise  sequences  iV;  were  taken  as  Gaussian  with  <iV>  =  100  and 
a{N)  =  20.  The  sequences  S;  were  also  Gaussian  with  <5>  =  0  and  cr(5)  variably 
chosen  from  0-05  cr(N)  to  0-45  (j{N). 

One  of  the  50  S,  sequences  was  modified,  so  as  to  give  it  n^-fold  symmetry,  by 
requiring  that  =  S,,  where  j  =  960/ n^.  Thus,  this  one  S,  sequence  consists  of 
copies  of  a  random  number  sequence  of  length  j. 

For  each  of  the  50  F, -sequences  the  quantity 

i  I 

i  =  1  s  =  0 

was  evaluated.  This  quantity  is  expected  to  have  its  lowest  value  for  the  “symmetric” 
sequence.  If  this  is  indeed  found,  the  search  is  taken  as  successful.  The  “noise  level”  for 
this  search,  as  formulated  here,  is  1  /50  or  2  %. 

Symmetry  searches  were  carried  out  for  n^=  12, 6  and  2.  For  each  choice  of  a(S)  50  to 
100  searches  were  done.  The  results,  shown  in  figure  2  bear  out  the  expected  sharp 
increase  in  the  success  rate  with  increasing  strength  of  the  symmetric  component,  (7(S), 


Criteria  of  fit  in  Patterson  search 


43 


100 


0 


0.0 


0.1 


0.2 


0.3 


0.4 


0.5 


sigma(s)/sigma  (/v) 


Figure  2.  Percentage  of  successful  symmetry  searches  as  a  function  of  a{S)/cF{N)  for  12-fold 
(left  curve),  6-fold  (centre)  and  2-fold  (right)  local  non-crystallographic  symmetry.  The  size  of 
the  noncrystallographically  symmetric  region  is  the  same  in  the  three  cases. 


in  relation  to  the  noise,  (t{N).  The  three  curves  also  show  the  increase  in  detectability 
which  accompanies  an  increase  in  symmetry,  here  from  2-fold  to  6-fold  and  12-fold. 

4.  Conclusions 

The  results  unambiguously  demonstrate  that  the  discriminating  power  of  a  Patterson- 
space  structure  search  improves  when  the  “minimum  average”  principle  is  applied  in 
calculating  the  criterion  of  fit.  We  conclude  that  in  searches  where  the  detectability  of 
the  correct  solution  is  at  all  in  doubt,  the  benefits  of  calculating  minimum  averages  are 
well  worth  the  additional  computing  time  required  for  the  sort,  on  P,/Wi,  which  is 
carried  out  at  every  step  in  the  search. 

Somewhat  more  tentatively,  the  results  suggest  that  the  criterion 


with  m  K  (0-4  — 0-5)n,  is  superior  to  the  presently  used 


Both  are  consistently  superior  to  the  criterion 


n 


i=  1 


It  should  be  emphasized  that  it  has  not  been  shown  or  suggested  that  any  of  these 
criteria  is  the  best  one  that  can  be  formulated.  What  the  best  one  is,  is  still  an  open 
question. 
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The  probability  of  large  structure  amplitudes:  The  space  group  P 1 


A  J  C  WILSON 

Crystallographic  Data  Centre,  University  Chemical  Laboratory,  Cambridge  CB2  iEW, 
England 

Abstract.  Until  recently,  expressions  derived  for  the  probability  distribution  of  the  magni¬ 
tudes  of  structure  amplitudes  have  been  valid  only  for  the  range  from  zero  up  to  two  or  three 
times  the  average  magnitude.  Recent  progress  in  obtaining  results  valid  for  larger  structure 
amplitudes  (up  to  unitary  structure  factors  approaching  unity)  is  reviewed;  the  methods 
employed  include  Fourier-series  representations,  steejDest-descents  approximations,  and 
computer  simulations.  Except  for  the  last,  the  results  are  valid  only  for  the  space  group  PI, 
though  those  for  the  space  groups  belonging  to  the  point  group  2/m  and  possibly  some  other 
centro-symmetric  groups  of  low  symmetry  are  expected  to  behave  similarly. 

Keywords.  Crystallographic  statistics;  large  structure  amplitudes;  probability  distribution; 
statistics. 


1.  Introduction 

The  calculation  of  the  probability  of  a  structure  amplitude  having  a  particular  value  is  a 
special  case  of  the  random-walk  problem.  If  correlations  can  be  neglected  (Wilson 
1981),  for  the  space  group  PI  the  problem  is  exactly  that  of  the  random  walk  in  two 
dimensions,  and  for  P 1  it  is  that  of  the  projection  on  a  line  of  a  random  walk  in  two 
dimensions.  Most  attention  has  been  given  to  walks  that  do  not  get  far  from  the  starting 
point — in  crystallographic  terms  structure  amplitudes  that  do  not  exceed  two  or  three 
times  their  average  magnitude — but  for  several  years  I  have  tried  unsuccessfully 
(Wilson  1980b)  to  obtain  an  estimate  of  the  probability  of  structure  amplitudes  with 
magnitudes  approaching  the  maximum  possible.  This  maximum  possible  is,  of  course, 
the  sum  of  the  atomic  scattering  factors  of  all  the  atoms  in  the  unit  cell: 

l^lmax  =  =/l  +/2  +/3  +  •  ■  •  (1) 

In  direct  methods  of  structure  determination  the  so-called  unitary  scattering  factors 
and  unitary  structure  amplitudes  are  often  used;  these  are  the  ordinary  ones  divided  by 
O,  and  are  thus  convenient  also  in  discussing  the  probability  of  large  structure 
amplitudes.  The  unitary  atomic  scattering  factor  of  the  jth  atom  is 

uj = m  (2) 

and  the  unitary  structure  amplitude  is 

U  =  P/(D.  (3) 

‘Large’  unitary  structure  amplitudes  are  thus  somewhat  less  than  unity.  Derivations  of 
the  mathematical  expressions  for  p{U)dU,  the  probability  that  a  structure  amplitude 
will  have  a  unitary  value  between  U  and  U  +  dU,  are  complicated  and  it  is  more  useful 
to  try  to  review  the  present  state  of  knowledge  than  to  work  through  any  derivation.  So 
far  I  have  been  able  to  locate  useful  analytic  results  only  for  structures  in  the  space  group 
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FI ;  it  seems  likely  that  other  low-symmetry  centrosymmetric  groups  will  behave  very 
similarly. 


2.  Exact  expression 


For  the  pure  mathematician  there  is  no  problem.  Kluyver  (1906)  was  able  to  give  p{U) 
as  an  integral; 


p{U)dU  =  {2n)-^ 


r 


n  ^(2WJX) 

_i=i 


cos{Ux)dxdU, 


(4) 


where  Jq  is  the  zero-order  Bessel  function.  Kluyver  himself  recognized,  however,  that 
this  expression  is  of  little  value  for  actual  computation  in  any  range  of  U. 


3.  Series  representations 


For  small  U,  the  exact  expression  (4)  reduces  to  the  central-limit  Gaussian  expression 
(Wilson  1949),  which  can  be  made  the  basis  of  a  series  expansion  in  terms  of  Hermite 
polynomials  (see,  for  example,  Karle  and  Hauptman  1953;  Rogers  and  Wilson  1953; 
Bertaut  1955;  Foster  and  Hargreaves  1963;  Shmueli  and  Wilson  1981,  1982).  The 
expansion,  in  either  the  Edgeworth  or  the  Gram-Charlier  form,  satisfies  the  Cramer 
criterion  for  convergence,  but  in  practice  there  may  be  unacceptable  negative  prob¬ 
abilities  in  the  region  of  present  interest. 

Barakat  (1973)  pointed  out  that  p{U)  vanishes  outside  the  range  —  1  to  -I- 1,  so  that  it 
can  be  represented  by  an  ordinary  Fourier  series.  His  actual  application  was  to  a  three- 
dimensional  random  walk  (with  polymer  chains  in  mind),  but  Weiss  and  Kiefer  (1983) 
have  modified  it  for  the  two-dimensional  case.  Their  expression  is 


piU)dU  =  ^ 


1+2  Z 


c. 


cos  nmU 


(5) 


where  the  Fourier  coefficients  C„  are  given  by 

{N 

Cm  =  n  ^(27tmMj).  (6) 

j=  1 


The  Fourier  series  has  the  great  advantage  that  the  coefficients  are  readily  obtained  for 
any  required  number  of  terms,  whereas  the  calculation  of  the  moments  required  for  the 
Hermite  coefficients  rapidly  becomes  more  tedious  as  the  order  of  the  polynomial 
increases.  For  most  space  groups  only  five-term  Hermite  expansions  are  available  at 
present,  whereas  there  is  no  definite  limit  to  the  number  that  can  be  used  in  the  Fourier 
series.  In  most  cases  tried  twenty  terms  seem  to  be  ample,  there  is  no  significant  change 
in  the  values  of  p{U)  when  the  number  is  increased  to  forty  (Shmueli  1983,  private 
communication).  A  series  simitar  to  equation  (5),  but  with  Bessel  functions  instead  of 
cosines,  can  be  developed  for  FI  (Weiss  and  Kiefer  1983,  private  communication). 


4.  Cramer’s  limit  theorem 

The  central-limit  theorem  is  not  the  only  limit  theorem,  and  results  due  to  Cramer 
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(1938)  can  be  applied  in  the  equiatomic  centrosymmetric  case  (Wilson  1983).  The  result 
is 

P{U)dU  =  {loih)}^^  exp{-^NhU)dU,  (7) 

where  h  is  the  solution  of 


I,ih)/Ioih)=U, 

the  /’s  are  modified  Bessel  functions,  and 
(7^  =  l-U/h-U\ 

For  small  U  equation  (7)  reduces  to  the  familiar  Gaussian,  and  for  large  U  to 


p(U)dU  = 


\iN 


{l-UY^-^dU. 


(8) 

(9) 

(10) 


5.  Steepest  descents 

Weiss  and  Kiefer  (1983)  use  a  more  general  approach,  the  method  of  steepest  descents 
(Daniels  1954),  and  obtain  equivalent  expressions  for  the  unequal-atom  case: 

p{U)dU  =  n  /o(N«,/i)|exp(-iiVljl/)d(7,  (11) 

where  h  is  the  solution  of  the  equation 

jN 

X  2ujI,iNujh)/Io{Nujh)=U,  (12) 

j=i 

and 

jv 

a^^lN  Y.  u/{l -I,iNujh)/NujhIoiNujh)-[IANujh)/Io{Nujh)y}  (13) 

j  =  i 

In 

=  2N  Y  uy-U/h-2N  Y  [ujIi{Nujh)/IoiNujh)y.  (14) 

7=1  7=1 

Equations  (11)-(14)  reduce  in  an  obvious  fashion  to  (7)-(9)  in  the  equal-atom  case,  in 
which  all  Uj  =  N~K 


6.  Resultants  near  maximum  possible 


An  unequal-atom  analogue  of  (10)  is  readily  obtained  from  (11),  but  Weiss  and  Kiefer 
(1983)  adopt  a  different  approach  that  leads  to  a  better  result.  In  effect,  they  expand  the 
Bessel  functions  in  the  exact  expression  (4)  into  asymptotic  series,  multiply  out,  and 
integrate  term  by  term.  The  result  is  a  series  in  ascending  powers  of  (1  —  C/),  of  which  the 
first  term  is 


p(U)dU  = 


(2n)*"  rttN)  n  (2uj)i 


7=1 


(l-t/)iiv-ldt/. 


(15) 
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Weiss  and  Kiefer  give  two  more  terms  explicitly  and  recurrence  relations  for  higher 
terms. 

Wilson  (unpublished)  has  developed  a  calculation  that  suggests  that  p{U)  will  behave 
like  a  power  of  (1  —  (7)  for  large  U  for  any  centrosymmetric  space  group. 

7.  Simulation 

Shmueli  (1982)  has  described  a  method  of  computer  simulation  that  gives  an 
‘experimental’  distribution  for  any  postulated  structure,  or  rather  for  any  postulated 
composition;  the  effects  of  correlation  of  the  real  atomic  positions  are  not  taken  into 
account.  None  of  the  theoretical  distributions  described  above  take  such  correlations 
into  account,  so  a  comparison  of  the  simulated  distributions  with  the  theoretical  is 
comparing  like  with  like,  whereas  a  comparison  with  distributions  actually  observed 
for  real  crystals  involves  a  further  complication,  the  full  effect  of  which  is  unknown 
(Wilson  1981).  Briefly,  the  simulation  method  consists  in  calculating  a  suitable  number 
of  structure  amplitudes  (3000  is  typical),  with  the  usual  atomic  scattering  factors  but 
with  hx,  ky  and  Iz  or  their  combinations  replaced  by  computer-generated  ‘random’ 
numbers.  The  resulting  |  f  |’s  are  sorted  by  size  into  suitable  groups  to  give  a  histogram; 
examples  are  given  by  Shmueli  (1982,  1983).  The  method  is  applicable  to  any  space 
group. 

The  simulation  method  seems  to  be  the  easiest  and  most  flexible  method  yet  devised 
for  obtaining  ideal  distributions  for  small  and  moderate  structure  amplitudes,  but  it  has 
one  aesthetic  and  one  practical  disadvantage.  Aesthetically,  numerical  calculations  tied 
to  a  specific  atomic  composition  and  a  specific  symmetry  are  less  pleasing  than  closed  or 
series  expressions  in  which  the  effect  of  changing  atomic  composition  or  symmetry  can 
be  ‘seen’.  Practically,  because  of  the  statistical  nature  of  the  simulation,  there  are 
random  fluctuations  in  the  heights  of  the  bars  of  the  histogram.  Wilson  (1983, 
unpublished)  has  estimated  the  extent  of  these;  for  3000  simulations  and  30  histogram 
divisions  the  residual  R  between  the  actual  histogram  and  the  ideal  distributions  should 
be  about  0  07.  To  obtain  reliable  estimates  of  the  probability  of  really  large  (greater 
than  four  times  the  average,  say)  structure  amplitudes  would  require  a  very  large 
number  of  simulations. 

8.  Accuracy 

The  Barakat  Fourier  series  (5)  seems  to  reproduce  the  exact  ideal  distribution  to  several 
decimal  places,  the  limiting  factor  being  probably  the  approximations  used  in  the 
computer  calculation  of  the  Bessel  functions;  at  present  its  use  is  limited  to  the  space 
group  P\.  Weiss  and  Kiefer  (1983  and  private  communications)  and  Shmueli  (1982 
and  private  communications)  have  made  various  numerical  comparisons  of  the 
Hermite-Gaussian  and  steepest-descents  expressions  with  the  Fourier  results  for 
various  total  numbers  of  atoms  (10  to  30)  and  various  ratios  of  atomic  scattering 
factors.  As  would  be  expected,  the  non-Fourier  analytic  expressions  work  best  for 
equal-atom  structures,  and  are  poor  if  there  is  one  pair  of  atoms  much  heavier  than  the 
rest,  when  the  distribution  becomes  bimodal.  The  expression  (15)  is  very  good  for  large 
U.  For  smaller  U,  the  steepest-descents  results  are  usually,  but  not  always,  better  than 
the  Gaussian-Hermite.  Simulation  results  agree  with  the  Fourier  within  the  expected  R. 
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The  preceding  remarks  refer  to  ideal  distributions  and  different  methods  of 
approximating  them.  The  distributions  observed  experimentally  for  real  crystals  will 
show  statistical  fluctuations  of  the  same  type  as  in  those  obtained  by  simulation.  The 
size  of  the  fluctuations  from  this  source  (there  are  several  other  sources)  is  inversely 
proportional  to  the  square  root  of  the  number  of  reflexions  observed,  but  3000  is  fairly 
typical.  Only  differences  between  an  observed  and  a  calculated  histogram  appreciably 
exceeding  an  R  of  0  07,  or  between  an  observed  and  a  simulated  histogram  appreciably 
exceeding  OT,  could,  with  reasonable  certainty,  be  attributed  to  non-statistical  factors 
(Wilson  1980a).  It  is  hoped  to  include  a  discussion  of  tests  of  the  statistical  significance 
of  apparent  differences  between  two  distributions,  either  one  observed  and  one 
calculated,  or  two  calculated,  in  a  forthcoming  paper  (Shmueli  et  al  1983). 
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Maximum  entropy — A  new  approach  to  the  crystallographic  phase 
problem 


RAMESH  NARAYAN*,  RAJARAM  NITYANANDA  and  G  V  VANI 

Raman  Research  Institute,  Bangalore  560080,  India 

*  Present  address:  Theoretical  Astrophysics,  California  Institute  of  Technology,  Pasadena, 
California  91125,  USA 

Abstract.  We  have  recently  drawn  the  attention  of  crystallographers  to  the  maximum  entropy 
method  (mem)  of  image  reconstruction  which  has  found  application  in  radio  astronomy  and 
geophysics.  In  this  article,  we  review  the  theory  of  the  mem  as  applied  to  the  crystallographic 
phase  problem  and  discuss  points  of  similarity,  and  differences,  with  conventional  techniques. 
We  then  present  a  number  of  illustrative  calculations  in  two  and  three  dimensions  on  simulated 
and  real  structures.  These  examples  show  that  the  mem  can  be  used  for  phase  refinement  from 
partially  known  phases,  say  in  macromolecular  crystallography,  and  also  for  solving  crystal 
structures  ab  initio  using  only  measured  Fourier  amplitudes.  With  improved  algorithms  the 
method  promises  to  become  competitive  with  current  techniques  in  crystal  structure  analysis. 

Keywords.  Crystallography;  crystal  structure  analysis;  phase  problem;  maximum  entropy 
method;  direct  methods. 


1.  Introduction 

In  conventional  x-ray  crystal  structure  analysis  one  measures  the  intensities  of  Bragg 
reflections  of  the  crystal  under  study.  The  intensity  /h,  corresponding  to  the  reciprocal 
vector  is  related  to  the  complex  Fourier  structure  factor  Fh^  by 


(1) 


where  the  structure  factors  Fh.  and  the  electron  density  p(r)  in  the  crystal  unit  cell  are 
related  by  a  Fourier  transform: 


Fhj  =  I  Fhj  I  exp  (itpHj )  =  p(r)  exp  {2ni H,-  ■  r)  d  F, 


(2) 


P('')  =  77  Z  Fhj  exp  ( -  27riH^  •  r) 

F  r  H  . 


(3) 


is  the  volume  of  the  crystal  unit  cell.  Thus,  one  experimentally  measures  only  the 
structure  amplitudes  |FhJ  whereas  according  to  (2)  and  (3)  one  requires  both 
amplitudes  as  well  as  phases  cph.  to  compute  the  electron  density.  This  is  the  famous 
phase  problem  of  crystallography — one  cannot  solve  the  structure  unless  one  has  some 
estimates  of  the  phases.  The  magnitude  of  the  problem  has  been  brought  out  strikingly 
by  Ramachandran  and  Srinivasan  (1970)  who  have  given  examples  of  how  true  phases, 
coupled  even  with  random  amplitudes,  can  bring  out  the  true  crystal  structure;  on  the 
other  hand,  true  amplitudes  with  random  phases  give  no  information  at  all. 

Obviously  one  needs  to  put  in  some  a  priori  information  in  addition  to  the  measured 
amplitudes  if  one  is  to  obtain  a  solution.  The  most  powerful  input  turns  out  to  be  the 
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prior  knowledge  that  the  crystal  consists  of  atoms  so  that  the  (positive)  electron  density 
is  strongly  clumped  in  well  separated  peaks.  Further,  one  generally  knows  in  advance 
the  number  and  types  of  atoms  present  and  also  has  limited  stereochemical 
information.  Although  a  rigorous  proof  is  lacking,  all  these  extra  constraints  in  practice 
seem  to  ensure  a  unique  solution  to  the  phase  problem,  apart  from  trivial  ambiguities 
related  to  the  choice  of  origin  or  enantiomorph.  This  solution  is  however  not  easy  to 
determine! 

For  small  structures  involving  a  few  atoms,  Patterson  techniques  work  quite  well. 
Here  one  computes  the  Fourier  transform  of  the  intensities  /hj  ( =  |  Fh,  |^)-  This  map  can 
be  shown  to  be  the  self-convolution  of  the  electron  density  {e.g.  Ramachandran  and 
Srinivasan  1970)  and  so  each  peak  in  it  corresponds  to  an  interatomic  vector  in  the 
original  structure.  If  the  peaks  are  well  resolved,  which  can  happen  in  small  structures, 
one  “inverts”  this  information  on  interatomic  bonds  to  solve  the  structure. 

Even  larger  structures  can  be  solved  using  the  Patterson  technique  provided  a  few  of 
the  atoms  are  significantly  heavier  {i.e.  have  more  electrons)  than  the  rest;  one  can 
obtain  the  heavy  atom  positions  from  the  Patterson  map  and  solve  the  rest  of  the  structure 
by  using  the  heavy  atom  derived  phases  for  the  structure  factors.  By  far  the  most 
challenging  aspect  of  the  phase  problem  in  crystallography  has  been  the  solution  of 
large  structures  of  nearly  equal  atoms.  Here  some  very  powerful  techniques  have  been 
developed  over  the  years  which  go  collectively  under  the  name  of  direct  methods  (see 
for  example  Ladd  and  Palmer  1980).  In  a  pioneering  paper  Sayre  (1952)  mathematically 
expressed  the  equality  of  atoms  by  the  statement  that  the  squared  structure  p^(r)  is  very 
similar  to  p(r)  except  for  a  “normalisation”.  He  showed  that  this  immediately  leads  to 
relations  among  the  phases  of  certain  sets  of  reflections — the  famous  triplet  phase 
relation.  Later  work  has  extended  this  approach  to  higher  order  relations  and 
probability  distributions  have  been  worked  out  for  the  values  of  sums  of  specific  groups 
of  phases  (the  so-called  structure  invariants  and  seminvariants).  Coupled  with 
multisolution  (Karle  and  Karle  1963,  1966;  Germain  et  al  1971)  methods  (where  one 
selects  a  small  set  of  primary  reflections,  systematically  assigns  various  phase  values  to 
these  and  in  each  case  solves  the  phases  of  the  other  reflections  using  triplet  and  other 
relations),  it  can  be  said  that  the  direct  methods,  exemplified  by  the  computer  program 
MULTAN  and  its  descendants,  are  the  most  powerful  techniques  available  today  to  solve 
crystal  structures.  However,  despite  the  enviable  success  that  these  techniques  have  had, 
it  is  probably  fair  to  say  that  the  last  word  has  by  no  means  been  said  on  the  phase 
problem.  The  search  for  newer  approaches  must  go  on  as  crystallographers  attempt  to 
solve  progressively  larger  structures. 

A  technique  called  the  maximum  entropy  method  (mem  ),  which  has  been  studied  for 
some  time  in  other  fields,  has  in  the  last  year  or  two  attracted  some  interest  in 
crystallography.  Two  of  us  (rn  and  rn  )  studied  the  mem  in  the  context  of  its  application 
in  radio  astronomical  image  processing  and  drew  attention  to  its  potential  importance 
in  crystallography  (Narayan  and  Nityananda  1981,  1982).  In  this  article  we  briefly 
review  some  of  the  work  already  reported  in  the  field  and  then  describe  our  more  recent 
results  on  simulated  and  real  crystal  structures  in  two  and  three  dimensions.  On  the 
basis  of  our  present  studies  we  believe  that  the  mem  has  a  promising  role  to  play  in 
crystallography  and  merits  further  investigation.  Readers  interested  in  the  actual 
results  of  this  method  applied  to  crystallography  could  proceed  straightaway  to  §§5 
and  6,  although  we  hope  that  this  will  generate  enough  interest  in  the  mem  to  return  to 
the  more  theoretical  sections. 
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2.  Maximum  entropy  method 


Let  us  suppose,  as  is  the  case  in  many  geophysical  and  radio  astronomical  applications, 
that  Fhj  is  available  both  in  amplitude  and  phase  at  some  of  the  but  is  unknown  at 
the  rest  of  the  points.  Setting  the  unknown  Fh.  to  zero  leads  to  a  map  with  large 
termination  ripple  as  well  as  loss  of  resolution.  In  the  mem  ,  the  unknown  Fh  .  are  chosen 
so  as  to  maximise  the  “entropy”  S  given  by 


S  - 


(4) 


where /is  some  suitable  real  function  of  p(r).  There  is  an  unresolved  debate  going  on  in 
the  literature  as  to  the  most  suitable  form  for  /  One  school  of  thought  including  the 
originator  of  the  mem  (Burg  1967;  Abies  1972)  favours  the  entropy 


Si 


In  [p(r)]dF„ 


(5) 


while  others  (Frieden  1972;  Gull  and  Daneill  1978)  prefer 


S2 


p(r)ln  [p(r)]dF,. 


(6) 


The  arguments  in  favour  of  one  or  the  other  form  are  usually  based  on  information 
theory  or  combinatorial  probabilistic  arguments,  verging  sometimes  on  the  metaphysi¬ 
cal.  It  has  been  claimed  that  the  mem  produces  as  featureless  a  map  as  possible 
consistent  with  the  data  (Ponsonby  1973)  and  that  it  is  maximally  non-committal  with 
regard  to  the  unmeasured  data  (Abies  1972).  In  an  interesting  paper  Komesaroff  and 
Lerche  (1978)  (see  also  Komesaroff  et  al  1981)  showed  that  there  is  an  intimate 
connection  in  one-dimensional  problems  between  the  entropy  Sj  and  the  positivity 
constraint  on  p(r);  however,  this  does  not  generalize  very  well  to  higher  dimensions. 

In  contrast  to  the  above  approaches  which  seek  to  attribute  a  deep  foundation  to  the 
MEM ,  there  has  been  another  movement  initiated  by  Hogbom  (1978)  and  Subrahmanya 
(1978, 1980)  who  suggested  that  the  successes  of  the  method  are  primarily  due  to  the  a 
priori  information  built  in,  in  particular  the  penalty  against  negative  values.  We  have 
extended  this  view-point  (Nityananda  and  Narayan  1982;  Narayan  and  Nityananda 
1983)  and  carried  out  a  detailed  study  of  the  mem  focussing  primarily  on  the 
maximisation  conditions  that  the  reconstruction  satisfies.  By  a  simple  geometrical 
argument  it  was  shown  that  any  “entropy”  function / which  satisfies/"  <  0  and /"'  >  0 
implicitly  makes  the  a  priori  assumption  that  the  map  consists  of  isolated  sharp  features 
separated  by  flat  extended  regions.  The  forms  (5)  and  (6)  as  well  as  several  others,  e.g. 
p^'^,  —  1/p,  etc.,  satisfy  these  conditions,  and  hence  all  these  are  suitable  “entropies”  to 
generate  satisfactory  maps.  In  our  view,  the  mem  is  just  a  variational  way  of 
incorporating  our  a  priori  information  that  the  true  map  consists  of  peaks  on  a  flat 
background.  The  relevance  of  the  mem  to  crystallography  then  becomes  obvious. 

Regardless  of  the  controversy  over  the  foundations  of  the  method,  it  is  well  known 
that  the  mem  produces  excellent  reconstructions.  Figure  1  shows  a  two-dimensional 
simulation  of  a  radio  astronomical  aperture  synthesis  reconstruction  where  the  mem 
map  is  clearly  far  superior  to  the  naive  approach  of  setting  unmeasured  F^^,  to  zero. 
Results  such  as  this,  coupled  with  our  conviction  that  the  a  priori  assumptions  built  into 
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Figure  1.  (a)  Simulated  “true”  map  p(x,  y)  of  a 
radio  astronomical  source  in  the  sky.  (b)  Map 
reconstructed  using  only  “measured”  Fourier  data 
F(h,  k)  corresponding  to  reciprocal  vectors  with 
\h\,  |fc|  ^  3.  All  the  other  F{h,k)  have  been  set 
equal  to  zero,  (c)  Reconstruction  from  the  same 
data  using  the  Sj  form  of  the  MEM  where  the 
unmeasured  Fourier  coefficients  are  optimized  to 
maximise  the  integral  of  -  p  In  p.  This  reconstruc¬ 
tion  was  obtained  by  setting  the  non-linearity 
parameter  R  (defined  in  §3)  to  the  value  100.  Full 
contours  correspond  to  positive  values,  the  suc¬ 
cessive  levels  being  at  0  01,  0  03,  0-05,  OT,  0-2,  0-3, 
0'5,  0’7  and  1.  Negative  contours,  in  dashed  lines, 
are  at  -001,  -003  and  -005.  (From 
Nityananda  and  Narayan  1982). 


the  MEM  are  perfectly  suited  to  crystallography,  originally  led  us  to  propose  that  the 
method  could  help  solve  the  phase  problem  (Narayan  and  Nityananda  1981,  1982). 


3.  Maximum  entropy  for  the  phase  problem 


For  the  phase  problem  we  write  (4)  in  the  form 


S  = 


ZI^Hjexp{i((pH.-27tH-r)} 

Hj 


where,  because  p(r)  is  real,  we  have 


(7) 


(8) 


Maximum  entropy  approach  to  crystallography 


55 


Let  us  define  the  Fourier  coefficients  G^^,  as  follows 


l^Hjl  ~ 


/'[p(r)]exp  (27riH^T)dF,. 


(9) 


Then,  maximising  the  entropy  S  in  (7)  with  respect  to  the  unknown  phases  we  obtain 
(Narayan  and  Nityananda  1981,  1982) 


I^hJ  ~ 


(10) 


Equations  (10)  (there  are  as  many  equations  as  the  number  of  unknown  phases)  have  to 
be  satisfied  in  order  to  maximise  the  “entropy”.  Since  |Gh  |  and  6^^  are  implicitly 
functions  of  all  the  these  are  a  highly  coupled  system  of  nonlinear  equations  which 
will  have  to  be  solved  numerically  by  an  iterative  procedure.  However,  we  note  that 
dS  I d(pj^  are  the  components  of  the  gradient  of  S  with  respect  to  the  unknown  and 
the  calculation  of  |Gh  |  and  involves  only  Fourier  transforms  which  can  be  very 
efficiently  carried  out  by  means  of  the  fast  Fourier  transform  algorithm.  Thus 
gradient-type  algorithms  to  maximise  S  (see  §  4)  are  easily  developed  and  these  are  not 
likely  to  be  very  expensive  in  terms  of  computing  time. 

A  crucial  issue  is  the  question  of  the  uniqueness  of  the  solution.  Of  course,  for  every 
solution  there  is  a  whole  family  related  to  it  by  an  origin  shift  and/or  enantiomorph 
reversal.  However,  apart  from  this  trivial  ambiguity,  it  is  possible  that  there  are  several 
distinct  maxima  representing  physically  different  solutions.  To  see  this  let  us  write  the 
conditions  for  the  vanishing  of  the  gradient  in  (10)  as 


where  the  n^^aTe  integers  which  have  either  of  the  two  values  0  or  1.  One  can  attempt  to 
solve  (10)  for  each  choice  of  the  n^^  (though  it  is  possible  that  for  certain  choices  there 
will  be  no  solution  at  all).  In  addition,  for  each  choice  of  the  there  may  be  more  than 
one  distinct  solution.  Presumably,  the  majority  of  such  solutions  correspond  to  saddle 
points  in  the  entropy  and  only  a  small  fraction  constitute  true  maxima.  A  function  of  n 
phase-like  variables  can  be  easily  seen  to  have  at  least  one  maximum,  one  minimum,  and 
n  saddle  points,  using  the  arguments  illustrated  for  n  =  2  in  the  context  of  solid  state 
physics  by  Ziman  (1972).  In  any  case,  it  is  clear  that  the  phase  problem  introduces  a  rich 
new  dimension  to  maximum  entropy.  The  existence  of  several  local  maxima  (about 
which  we  are  convinced  on  the  basis  of  our  computer  simulations)  means  that  any 
numerical  technique  is  liable  to  be  trapped  in  the  wrong  maximum.  The  saving  grace  is 
that  these  subsidiary  maxima  generally  display  elements  of  the  true  structure  and  are 
therefore  still  useful  for  the  elucidation  of  the  structure. 

Even  if  there  are  multiple  mem  solutions  to  the  phase  problem,  it  is  crucial  that  the 
true  structure  itself  should  be  close  to  a  solution — only  then  can  we  hope  to  get 
something  out  of  the  method.  At  a  general  level,  since  the  electron  density  distribution 
in  a  crystal  is  always  peaky  with  a  flat  baseline,  it  satisfies  the  a  priori  input  built  into  the 
MEM  (§2)  and  hence  is  likely  to  be  a  solution.  However,  one  can  more  rigorously 
establish  this  for  equal  atom  structures.  Figure  2a  schematically  shows  the  electron 
density  p(x)  (in  one  dimension)  of  a  “structure”  of  equal  symmetric  well-resolved 
atoms.  Let  us  for  convenience  focus  on  the  entropy  where  /(p)  =  In  (p).  Then 

/'(P)=l/P.  (12) 
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(a)  (b) 


Figure  2.  (a)  Schematic  electron  density  p{x)  of  a  one-dimensional  structure  with  two  equal 
“atoms”;  a  constant  “DC  value”  of  0-5  has  been  added,  (b)  Variation  of  l/p(x),  showing  that  it 
is  also  a  structure  with  two  equal  atoms;  the  atoms  here  are  negative,  have  a  different  shape 
from  those  in  (a)  and  hang  from  a  DC  level  of  2. 


and  its  shape  as  a  function  of  x  for  the  true  structure  is  shown  in  figure  2b.  It  is  clear  that 

/'[PtrueW]  =  ^  +  PtrueW  *  (13) 

where  X  is  a  constant,  *  represents  a  convolution  and  S{x)  is  a  symmetric  “shape” 
function.  Taking  the  Fourier  transform  of  (13)  we  have  for  any  non-origin  Hj 

(1*^) 

where  S^,,  the  Hjth  Fourier  coefficient  of  S(x),  is  a  real  number  (positive  or  negative). 
Thus  we  automatically  have 

0hj=<Phj  or  (pH^  +  n,  (15) 

depending  on  the  sign  of  S^.-  This  is  identical  to  the  conditions  written  in  (10)  or  (1 1) 
and  shows  that  equal  atom  structures  are  automatically  solutions  of  the  mem.  As  a 
further  bonus,  since  the  shape  function  S(x)  is  usually  known,  one  has  full  information 
on  Sn,  and  hence  one  can,  in  principle,  determine  all  the  in  (1 1).  This  greatly  reduces 
the  number  of  solutions  to  the  problem.  It  is  clear  that  the  above  considerations  work 
for  all  forms  of  the  entropy  and  are  dimension-independent. 

In  the  case  of  non-equal  atom  {i.e.  heavy  atom)  structures,  there  is  a  rather  subtle 
point  involved.  It  will  be  noticed  that  both  the  entropies  and  Sj  ((5)  and  (6))  are 
defined  only  for  positive  values  of  the  argument.  Hence,  at  any  intermediate  stage  of 
refinement,  when  one  has  a  lot  of  spurious  features,  in  the  map,  including  negative 
excursions  of  the  electron  density,  it  is  necessary  to  add  a  suitable  constant  C  to  p(r). 
Any  value  of  C  larger  than  the  greatest  negative  value  of  p(r)  is  enough  to  make  the 
problem  well-defined.  However,  it  is  clear  that  the  reconstructed  solutions  now  have  a 
non-trivial  dependence  on  the  value  of  C.  Rather  than  look  at  C  itself,  in  our  work  we 
have  found  it  useful  to  employ  another  parameter  R  given  by 

«=/"((>m,.  +  C)//"(P™,  +  C).  (16) 

We  have  borrowed  this  concept  from  our  earlier  work  in  radio  astronomy  (Nityananda 
and  Narayan  1982)  where  we  showed  that  R  is  a  measure  of  non-linearity.  For  the 
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present  purposes  it  is  sufficient  to  note  that  large  values  of  C  imply  small  values  of  R 
(however,  R  is  always  >  1)  while  the  smallest  permitted  C  {viz.  —  gives  i?  ->  oo.  It  is 
also  clear  that  as  1,/"  is  constant  and  /'  a  linear  function  of  p.  The  phases  of 
in  (9)  are  then  just  (p^^,OT  (py^  +n  and  all  maps  satisfy  the  mem  conditions  (1 1).  Now,  in 
our  experience,  low  values  of  R  lead  to  a  stable  centrosymmetric  solution  even  when  the 
true  structure  is  non-centric.  When  R  is  increased,  the  symmetry  is  broken  and  one  of 
the  two  enantiomorphs  grows  at  the  expense  of  the  other.  This  is  most  clearly  seen  in 
the  case  of  a  structure  with  a  single  heavy  atom,  where  the  enantiomorphs  are  related  by 
a  centre  of  symmetry  at  the  heavy  atom  location  (see  figure  3a).  As  R  is  increased  in 
magnitude,  the  ghost  peaks  in  the  “wrong”  enantiomorph  become  progressively  weaker 
(figure  3b).  This  suggests  that  mem  refinement  on  non-centric  structures,  particularly  if 
they  have  heavy  atoms,  should  be  carried  out  at  high  values  of  R,  i.e.  low  values  of  C. 
The  role  of  the  constant  C  vis  a  vis  centric  and  non-centric  structures  is  discussed  in 
terms  of  a  simple  illustrative  example  in  the  Appendix. 

Although  the  mem  is  a  variational  method,  having  its  origins  in  information  theory,  it 
turns  out  surprisingly  to  have  certain  features  in  common  with  more  conventional 
techniques.  If  we  look  at  the  mem  conditions  (10)  or  equivalently  (11),  we  see  that  the 
phases  0^  and  (p^l  ,arc  related.  Since  9^^  ,,by  (9),  is  a  function  of  all  the  through  p(r), 
we  thus  see  that  tfie  phase  relations  (11)  implicitly  involve  all  the  unknown  phases.  In 
conventional  crystallography  Sayre  (1952)  introduced  the  concept  of  triplet  phase 
relations,  where  the  phases  of  three  suitably  selected  reflections  are  tied  together.  This 


Figure  3.  (a)  The  true  structure  consists  of  12  atoms  shown  by  solid  lines,  with  a  heavy 
atom  located  at  the  strongest  peak.  When  the  true  phases  are  refined  by  the  mem  using  the 
entropy  form  S,  at  /?  =  9,  the  final  map  has  reasonably  strong  ghost  peaks  as  shown  by  the 
dashed  lines.  These  peaks  correspond  to  a  spurious  enantiomorph  molecule  sharing  the  same 
heavy  atom  as  the  true  structure,  (b)  When  the  refinement  is  carried  out  at  K  =  100,  the  ghost 
peaks  are  very  much  weaker  showing  that  the  mem  breaks  the  symmetry  more  effectively.  On 
the  other  hand,  if  one  used  R  4:9,  one  would  obtain  a  completely  centrosymmetric  map  even 
though  the  structure  amplitudes  correspond  to  a  non-centric  structure. 
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has  since  been  extended  to  quartets,  quintets  etc.,  which  are  believed  to  be  more 
powerful,  though  also  more  difficult  to  implement  (Ladd  and  Palmer  1980).  Since  the 
MEM  involves  very  high  order  phase  relations  which  are  relatively  easy  to  implement 
through  Fourier  transforms  and  the  gradient  method,  there  is  a  case  for  believing  that 
the  method  may  be  an  improvement  over  current  direct  methods.  One  can  look  upon 
the  “non-linearity”  parameter  R  as  determining  the  extent  to  which  different  phases  are 
coupled  by  the  mem  condition,  since  they  become  decoupled  in  the  P  ->  1  limit.  Since  R 
can  be  controlled  through  the  constant  C,  this  may  be  considered  an  added  advantage. 

There  has  been  interest  in  recent  times  in  a  technique  called  the  maximum 
determinant  method  (Lajzerowicz  and  Lajzerowicz  1966;  Tsoucaris  1970, 1980)  which 
is  based  on  the  Karle-Hauptmann  (1950)  determinant  and  involves  high  order  phase 
relations.  We  have  proved  (Narayan  and  Nityananda  1982)  that  the  maximum 
determinant  method  is  closely  related  to  maximising  the  entropy  Sj  (Britten  and 
Collins  1982  have  also  independently  reached  the  same  conclusion);  this  is  therefore 
another  contact  between  conventional  crystallography  and  the  mem. 


4.  Numerical  algorithm 


We  have  carried  out  a  series  of  calculations  on  model  “structures”  in  two  and  three 
dimensions  as  well  as  on  a  few  real  structures  in  three  dimensions.  We  have  employed 
the  gradient  approach  to  maximisation  of  S,  using  the  expression  (10)  for  the 
components  of  the  gradient.  The  alternative  fixed-point  approach  which  seeks  to 
directly  achieve  the  conditions  (11)  could  be  faster  when  properly  implemented. 
However,  fixed  point  schemes  are  often  notoriously  unstable;  therefore  this  approach 
seemed  unsafe  to  us  at  this  early  stage  of  investigation  when  speed  of  convergence  is  of 
secondary  importance  compared  to  the  need  to  understand  the  properties  of  the 
method.  The  gradient  method  on  the  other  hand  is  very  stable  and  guarantees  that  S 
increases  with  each  iteration. 

All  our  studies  have  been  in  the  lowest  symmetry  viz  pi  (in  2D)  and  FI  (in  3D).  This 
was  dictated  by  two  considerations.  Firstly,  conventional  methods  work  very  well  for 
higher  symmetry  crystals,  particularly  when  they  are  centric.  It  seemed  that  the  mem 
could  hope  to  make  the  greatest  contribution  in  low  symmetry  non-centric  structures. 
Secondly,  the  gradient  algorithm  requires  continuous  variation  of  phases  whereas 
centric  structures  have  only  two  values  for  each  phase  viz  0  or  n. 

In  our  work  with  the  gradient  method,  we  often  noticed  that  a  single  peak  in  the  map 
grew  at  the  expense  of  the  others,  thus  making  it  virtually  impossible  to  interpret  the 
map.  We  have  been  able  to  suppress  this  tendency  by  maximising  a  modified  form  of 
“entropy”  viz 


S'  = 


{/[Ci+p(r)]-E/[C2-p(r)]}dF„ 


(17) 


where  Cj  is  a  constant  (>  —  Pniin)  which  is  adjusted  to  give  the  required  value  of  R 
(equation  (16))  and  Cj  is  a  second  constant  set  to  have  a  value  slightly  greater  than  the 
theoretical  height  of  the  heaviest  atom  (see  figure  4).  The  constant  C2  ensures  that  any 
peak  which  attains  the  expected  height  of  an  atom  does  not  grow  any  further,  thus 
giving  a  chance  for  the  other  atoms  to  develop.  The  constant  Ci  is  regularly  updated  at 
each  cycle  in  order  to  maintain  the  value  of  the  nonlinearity  parameter  R.  In  the  early 


Maximum  entropy  approach  to  crystallography 


59 


Figure  4.  Schematic  diagram  showing  the  essentials  of  our  modified  entropy  ( 1 7),  which  has 
a  variable  “floor”,  adjusted  to  maintain  a  preset  value  of  R,  and  a  fixed  “roof’,  set  so  as  to 
discourage  any  peak  from  growing  beyond  the  theoretically  expected  height  of  an  atom. 


stages  of  ab  initio  structure  determination,  we  usually  start  with  totally  random  phases 
for  the  reflections.  The  corresponding  p{r)  would  have  no  distinct  peaks  and  would 
have  relatively  large  negative  excursions.  At  this  stage  Cj  will  have  a  large  value.  As  the 
refinement  proceeds,  peaks  will  start  developing  and  at  the  same  time  the  ripple  at  the 
bottom  would  get  ironed  out,  resulting  in  smaller  values  for  C^. 

Each  cycle  of  gradient  refinement  involves  the  following  calculations.  Using  the 
current  phases  we  compute  p{r)  by  means  of  a  Fourier  transform,  compute  the 
function  +p(r)]  — /'[U2  —  p(r)] }  and  calculate  G^,  its  Fourier  transform. 

Equations  (10)  then  give  the  components  of  the  gradient  of  S'.  Now  we  shift  the  by 
small  amounts  proportional  to  the  gradient  components  and  compute  the  new  gradient 
at  the  shifted  phases.  We  then  calculate  and  apply  the  shift  necessary  in  order  to 
minimise  the  magnitude  of  the  gradient  at  the  final  point  (one  could  also  use  the  more 
standard  scheme  of  requiring  the  final  gradient  to  be  orthogonal  to  the  starting  one). 
Thus,  one  cycle  involves  four  Fourier  transforms. 


5.  Two-dimensional  simulations* 

We  have  carried  out  a  number  of  tests  of  the  mem  on  model  two-dimensional 
“structures”  with  typically  10-15  equal  “atoms”  in  the  unit  cell.  The  electron  density 
was  sampled  on  a  32  x  32  grid  and  correspondingly  the  structure  factors  ranged  over 
reciprocal  indices  going  from  — 15  to  +16(— 16  being  identified  with  -b  16  because  of 
the  translation  symmetry  in  reciprocal  space  introduced  by  the  discrete  sampling  in  real 
space). 

Figure  5  shows  the  result  of  a  typical  refinement  from  partial  phase  information.  The 
structure  factors  of  the  model  structure  with  15  equal  atoms  were  calculated  and  their 
phases  were  given  random  errors,  with  a  rectangular  distribution,  of  rms  70°.  The 


*  The  results  described  here  were  presented  at  the  XIII  National  Seminar  on  Crystallography,  Nagpur,  India, 
March  1982. 
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Figure  5.  (a)  The  true  structure  consists  of  12  equal  atoms  at  the  vertices  joined  by  the 
straight  lines.  The  map  shown  here  is  obtained  by  adding  phase  errors  of  rms  70'’  to  all  the 
reflections,  (b)  Map  obtained  after  refining  the  phases  using  the  mem. 


corresponding  electron  density  map  is  shown  in  figure  5a.  The  structure  is  still  clearly 
seen  but  there  are  many  spurious  low-level  features  in  the  map.  Figure  5b  shows  the 
remarkable  improvement  that  can  be  achieved  by  means  of  mem  refinement.  Here  we 
did  20  cycles  of  gradient  refinement  with  the  entropy  Si  modified  as  in  (17),  using  R 
=  10.  The  final  rms  phase  error  is  only  21-7°.  Actually,  most  of  the  improvement  takes 
place  in  the  first  few  cycles.  For  instance,  after  10  cycles,  the  rms  is  as  low  as  30-8°.  From 
a  number  of  similar  studies  we  find  that  the  structure  refines  well  from  initial  phase 
errors  of  up  to  70-75°.  There  are  many  situations,  particularly  in  macromolecular 
crystallography,  where  one  can  obtain  limited  phase  information  using  techniques  such 
as  isomorphous  replacement  and  anomalous  dispersion.  There  appears  to  be  good 
hope  that  the  mem  can  greatly  help  in  phase  refinement  in  such  cases.  Collins  (1982)  has 
published  some  results  on  phase  refinement  on  real  protein  data  using  the  mem.  While 
there  is  some  improvement,  his  results  are  not  as  impressive  as  those  in  figure  5.  This 
could  be  because  of  experimental  errors  in  the  structure  amplitudes  or,  more  probably, 
because  the  atoms  in  his  structure  are  not  well  resolved  (the  mem,  being  peak-seeking,  is 
likely  to  work  best  with  high  resolution  data). 

The  more  difficult,  and  therefore  more  interesting  problem  is  to  solve  a  crystal 
structure  ab  initio,  starting  from  pure  amplitudes  and  no  phase  information.  We  have 
tried  this  in  a  number  of  two-dimensional  simulations  and  figures  6-8  show  some 
typical  results.  In  each  case  we  started  the  refinement  with  totally  random  phases  and 
refined  for  20  cycles  with  entropy  Si  at  i?  =  10.  Although  the  final  map  usually  had  a 
few  spurious  peaks,  invariably  the  structure  could  be  identified,  with  at  most  one  or  two 
atoms  not  seen.  As  a  more  objective  test  of  the  power  of  the  method,  we  carried  out  a 
few  “blind  tests”  of  the  following  kind.  One  of  us  (Narayan)  generated  a  model  and 
produced  a  list  of  structure  amplitudes  along  with  stereochemical  information  on  the 
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Figure  6.  The  true  structure  is  the  same  as  in  figure  5.  Random  phases  were  given  to  the  true 
structure  amplitudes  and  refined  using  the  mem.  Eleven  out  of  12  atoms  can  be  fitted  on  peaks 
in  the  refined  map.  The  atom  marked  A  is  absent  with  no  nearby  peak,  while  the  peak  S  is  a 
strong  spurious  feature.  This  is  a  typical  result  obtained  with  the  mem  in  two-dimensional  ab 
initio  structure  analysis.  Note  the  origin  shift. 


Figure  7.  The  true  structure  consists  of  2  molecules,  of  8  equal  atoms  each,  with  the 
orientations  and  relative  spacings  shown.  The  mem  map  obtained  by  refining  random  starting 
phases  reveals  most  of  the  structure  except  the  absent  atom  A.  There  is  one  strong  spurious 
peak  S. 
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Figure  8.  The  true  structure  consists  of  4  molecules,  of  5  equal  atoms  each,  in  cm  symmetry. 
The  MEM  map,  refined  from  random  starting  phases,  without  including  the  information  on  the 
mirror,  shows  all  the  10  independent  atoms  and  has  no  strong  spurious  peaks. 


“molecule”.  Another  (Vani),  given  a  map  with  random  phases,  would  refine  them  by 
means  of  the  mem  and  try  to  interpret  the  resulting  map.  In  all  the  three  cases  where  this 
was  tried  the  structure  was  solved  correctly — usually,  one  or  two  atoms  did  not  appear 
in  the  mem  map,  but  these  could  be  easily  located  by  standard  Fourier  techniques 
(Ramachandran  and  Srinivasan  1970).  Because  we  start  from  random  phases  in  our 
calculations,  there  is  invariably  an  origin  shift  in  the  solution  and  in  50  %  of  the  cases 
there  would  also  be  an  enantiomorph  reversal. 

A  point  to  be  emphasized  is  that  all  our  calculations  were  carried  out  in  pi  symmetry 
which  is  considered  the  most  difficult.  Figure  8  shows  an  interesting  case  where  the  true 
structure  has  cm  symmetry  but  we  did  not  use  the  information  on  the  mirror  during  the 
refinement.  Thus,  although  this  structure  has  only  five  independent  atoms,  we  solved  it 
as  if  it  was  a  10-atom  structure  in  cl  symmetry.  It  is  interesting  that  the  mirror 
symmetry  comes  to  light  during  the  structure  solution,  though  not  perfectly. 

Another  interesting  feature  is  that  in  all  our  calculations  we  directly  used  the 
structure  amplitudes  \F^^\  without  any  modification.  In  the  direct  methods  a  crucial 
prerequisite  is  “sharpening”  of  the  data  to  obtain  so-called  £’s,  corresponding  to  a 
point  atom  structure.  In  our  experience,  the  mem  works  very  well  with  F’s  and,  in  fact, 
does  not  refine  so  well  when  F’s  are  used. 


6.  Three-dimensional  refinement — real  structures 

Encouraged  by  the  results  in  two  dimensions  we  have  very  recently  embarked  on  three- 
dimensional  structure  determination  using  the  mem.  Due  to  memory  limitations  in  the 
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mini-computer  (PDP  11/34)  with  which  we  worked,  we  used  al6xl6xl6  unit  cell  for 
the  electron  density,  corresponding  to  {h,  k,  t)  ranging  from  —  7  to  -1-8.  Once  again  all 
refinement  was  carried  out  in  PI  symmetry  using  the  gradient  method.  Whereas  in  two 
dimensions  the  entropy  Sj  appears  to  be  marginally  better  than  S' 2 ,  we  find  the  opposite 
to  be  true  in  three  dimensions.  Hence  all  the  results  described  here  correspond  to  the 
entropy 

S'2-  -|{[Ci+p(r)]ln[Ci+p(r)]  +  [C2-p(r)]ln[C2-p(r)]}dl^,.  (18) 

The  constant  was  adjusted  to  maintain  P  =  10  while  C2  was  taken  to  be  slightly 
greater  than  the  theoretical  height  of  the  strongest  atom.  The  refinement  is  typically 
slower  than  in  two  dimensions,  needing  of  the  order  of  50-100  cycles,  though  once 
again  most  of  the  gain  is  achieved  early  on. 

To  begin  with  we  generated  a  model  structure  with  15  equal  atoms  in  PI  symmetry 
and  refined  the  phases  using  the  mem,  starting  from  random  values.  The  electron 
density  after  60  cycles  of  refinement  was  scanned  for  peaks  and  these  were  listed  in 
decreasing  order  of  magnitude.  The  structure  was  very  clearly  identified  from  the  peaks. 
Among  the  first  15  peaks,  10  belonged  to  atoms  in  the  structure  and  5  were  spurious. 
Among  the  10  correct  peaks,  3  {viz  peaks  2, 3, 4  in  the  order  of  ranking)  were  elongated 
and  had  a  second  atom  in  the  tail  region.  This  is  a  problem  we  faced  in  all  our  three- 
dimensional  calculations — because  of  the  lack  of  resolution  (since  we  have  only  16  grid 
points  along  each  axis),  peaks  sometimes  merged  and  two  atoms  would  be  represented 
by  a  single  elongated  peak.  In  such  cases,  usually  one  atom  is  close  to  the  strongest 
region  of  the  peak  and  the  other  weakly  present  in  the  tail.  Thus,  in  our  simulated  15- 
atom  structure,  the  first  15  peaks  accounted  for  13  atoms  and  in  fact  the  first  8  peaks 
were  all  correct  and  contained  1 1  atoms.  The  final  2  atoms  in  the  structure  were  at  peaks 
23  and  26.  We  feel  that  the  performance  of  the  mem  is  very  encouraging. 

Emboldened  by  this  we  have  tried  to  “solve”  a  few  real  structures  using  real 
experimental  data.  Unfortunately,  we  could  not  find  in  the  literature  any  suitable  equal- 
atom  PI  structure  whose  data  would  fit  within  our  16^  grid.  As  the  next  best  thing,  we 
selected  a  few  structures  in  Pi  symmetry  and  refined  them  in  PI  symmetry,  ignoring  the 
centre  of  symmetry  altogether. 

Martuscelli  (1969)  has  reported  the  crystal  structure  of  octa-trans-2,  trans-6-diene-l, 
8-dioic  acid  which  crystallizes  in  PI  symmetry  with  unit  cell  parameters  a  =  12  00 A, 
b  =  3-94  A,  c  =  7-72  A,  a  =  121-35°,  jS  =  139-4°,  y  —  70-05°.  He  solved  the  structure 
from  about  500  observed  reflections  using  model-fitting  procedures  based  on  the 
Patterson  maps.  Because  the  cell  parameter  a  is  rather  large,  our  calculations  on  this 
structure  were  hampered  by  the  extremely  poor  resolution  (0-75  A)  in  this  direction. 
Also,  we  could  include  only  reflections  with  h  in  the  range  —  7  to  -1-8  which  reduced  the 
number  of  structure  amplitudes  we  could  use  to  326.  In  spite  of  these  limitations,  the 
structure  refined  quite  easily  from  random  phases.  After  100  cycles  of  refinement,  the 
first  12  peaks  in  the  map  included  all  the  12  atoms  of  the  structure.  Peaks  7  and  9  were 
spurious  while  peaks  1  and  2  included  2  atoms  each.  This  is  very  gratifying  since  we  had 
very  limited  data  and  severe  resolution  problems. 

Our  next  attempt  on  a  much  larger  equal  atom  structure  was  not  so  spectacular 
though  still  rather  encouraging.  Karle  (1969)  has  reported  the  structure  of  a  thymine- 
thymine  adduct  (obtained  from  irradiated  thymine).  This  is  again  in  Pj symmetry  with 
38  atoms  in  the  unit  cell  (2  x  19  independent  atoms);  a  =  9-44  A,  h  =  8-29  A,  c 
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=  7-57  A,  a  =  99  0°,  ^  =  91-5°,  y  =  89-8°.  The  data  consisted  of  1844  independent 
reflections  and  the  centrosymmetric  structure  was  originally  solved  by  obtaining  the 
structure  factor  signs  using  the  symbolic  addition  procedure  (Karle  and  Karle  1966). 
We  attempted  to  solve  the  38  atoms  of  the  structure  in  PI  symmetry  with  the  mem, 
using  1370  reflections  which  fitted  within  our  grid.  After  100  cycles  of  refinement 
starting  from  random  phases,  we  found  that,  of  the  first  38  peaks,  18  were  correct, 
including  22  atoms  (peaks  1,  6, 1 1  and  14  had  two  atoms  each),  while  the  first  80  peaks 
had  25  correct  peaks  with  31  atoms;  7  atoms  had  no  peaks  at  all  in  the  map.  Of  the  two 
molecules,  one  was  seen  quite  clearly  while  the  other  was  rather  weak.  The  refinement 
has  clearly  not  been  too  successful;  on  the  other  hand,  we  believe  we  could,  if  necessary, 
have  solved  the  structure  from  our  map  with  a  little  patience  if  the  structure  were  not 
already  known.  Karle  (1969)  used  6  symbols  for  which  a  unique  assignment  of  signs  was 
indicated  by  the  symbolic  addition  method  and  the  first  19  peaks  of  the  resulting  £-map 
showed  the  19  independent  atoms  in  the  structure — a  truly  remarkable  performance. 
However,  considering  that  (i)  we  had  to  use  less  data,  (ii)  we  had  poor  resolution  in  the 
electron  density  map,  (iii)  we  refined  38  independent  peaks,  (iv)  we  worked  in  a  non- 
centric  framework  and  (v)  we  have  not  really  optimised  our  refinement  procedure,  we 
feel  our  results  using  the  mem  are  quite  good.  We  may  point  out  that  a  structure  with  38 
atoms  in  PI  is  not  trivial  even  today  and  would  probably  have  been  considered  quite 
difficult  a  decade  back. 

As  a  final  example  we  attempted  a  simple  heavy  atom  structure.  Dutta  and  Woolfson 
(1969)  solved  the  structure  of  tetraethyl  diphosphine  disulphide,  which  crystallizes  in  Pi 
symmetry,  by  direct  methods  from  two  projections  using  two  symbols  in  each.  The  unit 
cell  has  12  atoms  (6  independent  atoms  if  one  uses  the  PI  symmetry),  of  which  2  sulphur 
and  2  phosphorus  atoms  are  heavy,  while  the  other  8  are  (light)  carbon  atoms.  The  unit 
cell  parameters  are  a  —  8  98  A,  b  =  6-45  A,  c  =  6T5  A,  a  =  11 3  0°,  ^  —  85-2°, 
y  =  102-5°.  Of  the  864  reflections  observed,  we  could  accommodate  only  760  within  our 
grid  and  the  phases  of  these  were  refined  as  usual  with  the  mem  .  After  70  cycles  of 
refinement  the  first  12  peaks  contained  10  atoms  with  2  spurious  peaks  (8  and  10  in 
ranking).  The  first  4  peaks  were  the  4  heavy  atoms  and  the  two  atoms  that  were  missing 
were  the  weakest  in  the  final  solution,  having  heights  only  ~  18  %  of  the  sulphur  atoms. 
The  success  of  the  mem  with  this  structure  is  interesting  as  it  shows  that  the  method  is 
equally  applicable  to  heavy  atom  structures  (as  indeed  the  direct  methods  also  are, 
although  they  were  originally  developed  for  equal  or  near-equal  atom  structures). 

In  the  various  structure  determinations  described  above,  there  was  nothing  special 
about  the  starting  random  phases  which  could  have  helped  in  the  refinement.  We  have 
tried  different  sets  of  random  phases  and  usually  obtained  comparable  results.  Also, 
although  we  refined  for  ~  1(X)  cycles  in  all  the  cases,  most  of  the  refinement  is  over 
within  the  first  20  cycles  and  the  later  calculations  only  help  to  “sharpen”  the  peaks  and 
define  then  better.  Finally,  we  should  mention  that,  as  in  two  dimensions,  here  again  we 
obtained  much  better  results  using  the  P’s  rather  than  P’s. 


7.  Discussion  and  conclusions 

The  phase  problem  of  crystallography  is  probably  the  most  difficult  and  challenging 
among  the  variety  of  Fourier  inversion  problems  known  in  several  fields.  The  success  of 
the  MEM  in  other  fields  in  different  contexts  is  therefore  no  guarantee  of  success  when 
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applied  to  crystallography.  Although  we  and  a  few  others  have  been  excited  about  the 
prospects  of  the  mem  in  crystallography,  there  was  until  recently  only  limited  reason  for 
optimism.  Firstly,  we  (as  well  as  Britten  and  Collins  1982)  had  shown  that  the  mem  is 
related  to  the  maximum  determinant  method  of  crystallography;  we  had  further  argued 
that  the  mem  phase  relations  (equations  (10))  are  higher  order  phase  relations  which  are 
likely  to  be  more  powerful  than  the  lower  order  relations  (triplets,  quartets,  etc.) 
conventionally  used  and  that  equal  atom  structures  satisfy  the  mem  relations.  Secondly, 
Collins  (1982)  had  shown  that  phase  refinement  is  feasible  in  macromolecular 
crystallography,  starting  from  approximately  known  phases. 

We  believe  that  the  results  presented  in  this  paper  greatly  improve  the  case  for  trying 
the  MEM  in  crystal  structure  analysis.  We  have  shown  that  the  method  works  well  in 
small-  and  medium-sized  real  crystals  and  has  the  ability  to  locate  a  high  percentage  of 
the  atoms  starting  from  random  phases  for  the  structure  factors.  It  is  worth  stressing 
once  again  that  all  our  work  has  been  in  PI  symmetry,  which  is  usually  considered  the 
most  difficult. 

A  comparison  of  mem  refinement  (as  we  implement  it)  with  conventional  direct 
methods  shows  two  important  differences. 

(a)  In  the  direct  methods  one  works  with  £’s  and  solves  for  point  atoms.  On  the 
other  hand,  we  find  that  the  mem  works  best  with  P’s.  It  is  possible  that  this  difference 
arises  because  the  mem  ,  apart  from  looking  for  peaks  as  do  the  direct  methods,  is  also 
concerned  about  the  flatness  of  the  baseline.  Since  an  £-map  is  likely  to  have  a  lot  of 
ripple  even  with  correct  phases  (due  to  data  truncation  and  experimental  errors),  this 
might  be  a  possible  explanation. 

(b)  In  the  direct  methods,  the  multisolution  approach  is  the  rule  where  a  small  set  of 
primary  reflections  are  systematically  given  various  phases.  One  set  then  produces  the 
structure  while  the  others  are  usually  uninterpretable.  In  our  tests  so  far  we  have  found 
that,  regardless  of  the  starting  set  of  random  phases,  the  structure  usually  comes  up  to 
about  the  same  level  of  interpretability.  It  is  possible  that  we  are  still  working  with 
structures  that  are  too  small  and  that  the  mem  also  may  need  a  multisolution  approach 
with  larger  structures.  On  the  other  hand,  there  is  a  distinct  possibility  that,  because  of 
its  higher  order  phase  relations,  the  method  is  able  to  attract  the  solution  towards  the 
correct  structure  from  a  wider  range  of  starting  phases. 

Because  of  the  above  reasons  we  feel  that  the  mem  is  not  just  the  direct  methods  in  a 
new  disguise  but  is  distinctly  different.  As  such  it  merits  further  study  in  more 
complicated  situations.  The  ability  to  work  with  P’s  rather  than  P’s  might  turn  out  to 
be  a  big  advantage  since  the  method  will  be  less  susceptible  to  experimental  errors.  In 
our  view  the  following  further  questions  now  need  urgent  attention. 

(a)  Although  the  gradient  method  we  have  employed  is  useful  for  the  sort  of 
preliminary  studies  we  have  made,  the  future  probably  belongs  to  fixed  point  schemes 
{e.g.  Sayre  1980)  which  directly  seek  to  satisfy  the  mem  conditions  (10).  Not  only  would 
the  method  then  easily  work  for  centrosymmetric  strucutres,  there  is  also  the  real 
advantage  that  one  can  use  one’s  a  priori  information  on  the  shapes  of  atoms  through 
the  S„,  in  (M)  to  determine  the  n^j  in  (1 1).  This  information  cannot  be  introduced  into 
the  gradient  algorithm. 

(b)  One  needs  sensitive  figures  of  merit  to  measure  the  goodness  of  the  refinement, 
particularly  if  the  mem  is  to  be  combined  with  the  multisolution  approach. 

(c)  Techniques  such  as  isomorphous  replacement  and  anomalous  dispersion  are 
employed  to  give  added  information  in  macromolecular  phase  refinement  and  these 
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could  be  put  in  variationally  into  the  mem  .  For  instance,  in  isomorphous  replacement, 
one  knows  that  the  electron  density  distribution  in  a  protein  and  its  derivatives  are 
identical  except  at  the  locations  of  the  replaced  atoms  where  the  differences  are  known. 
So  too,  in  the  anomalous  dispersion  case,  where  the  electron  density  becomes  complex, 
one  has  the  powerful  a  priori  information  that  the  peaks  in  the  imaginary  part  occur 
only  at  the  centres  of  the  peaks  in  the  real  part.  One  technique  to  introduce  these  extra 
constraints  is  to  maximise  the  integral  0f/(Pderivative  -Pprotein)  and/(Preal  -  Amaginary  )> 
which  is  equivalent  to  introducing  the  a  priori  input  that  (pderivatWe  —  Pprotein )  has  a  flat 
baseline  with  sharp  well-resolved  peaks  and  that  (Preai  “Amaginary)  has  no  negative 
peaks.  These  and  other  similar  ideas  deserve  to  be  pursued,  particularly  since  Collins 
(1982)  has  obtained  interesting  results  even  with  just  the  bare  mem. 
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Appendix 


We  present  here  an  illustrative  example  in  one  dimension  bringing  out  the  crucial  role 
played  by  the  zero  Fourier  coefficient  Fq.  We  consider  a  periodic  real  function  in  one 
dimension  which  is  sampled  at  4  points  in  the  unit  cell  so  that  the  function  is  completely 
described  by  its  values  p_i,  po,  Pi,  P2  (the  origin  has  been  taken  at  the  second  point). 
The  Fourier  coefficients  are  also  periodic.  As  the  p’s  are  real,  Fq  and  F 2  are  real  and 
FI  1  =  Fi  =  |Fi  I  exp  {i(Pi)  are  the  only  structure  factors  with  a  continuously  variable 
phase;  even  here,  if  we  eliminate  physically  identical  solutions  (related  by  an  origin  shift 
or  enantiomorph  reversal),  the  range  of  cpi  from  0  to  njl  completely  describes  all 
distinct  solutions  compatible  with  a  given  set  of  amplitudes  Fq,  |Fi|,  F2. 

Consider  the  centric  “structure”  given  by 

po  -  2-0,  p_  1  =  Pi  =  0-5,  P2  =  0,  (Al) 


whose  Fourier  transform  gives 


Fq  -  1-0,  [Fi  I  =  0-75,  (Pi  -  0°,  F2  -  0-5. 
We  investigate  the  variation  of  the  entropy 


S,(C,(Pi)  = 


In  [C  4-p(<Pi)]dx 


J 


Z  ln[C  +  A(7’i)]> 
j=  -1 


(A2) 

(A3) 


where  Pj(<Pi)  are  calculated  with  the  true  amplitudes  and  the  phase  (pi .  Adding  C  to  all 
the  pj  is  equivalent  to  replacing  Fq  by  Fq  -I-  C.  Figure  9  shows  the  variation  of  Si  with  (p^ 
for  different  values  of  C.  At  high  values  of  C,  the  maximum  value  of  Si  is  seen  to  occur  at 
the  correct  phase,  (Pi  =  0.  However,  at  low  values  of  C  the  maximum  shifts  away  from  0 
and  at  C  =  0  02,  for  instance,  occurs  at  cpi  ~  10°.  Thus,  at  low  values  of  C,  the  mem 
solution  is  non-centric  even  though  the  data  correspond  to  a  centric  structure. 
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Figure  9.  Variation  of  entropy  Sj  as  a  function  Figure  10.  Variation  ofentropySj  as  a  function 

of  phase  (/)  ]  for  the  centrosymmetric  structure  (A  1 ),  of  phase  (p  i  for  the  non-centrosymmetric  structure 

{A2)  for  various  “DC  offsets”  C.  Note  that  the  (/14),  (.45)  for  various  “DC  offsets”  C.  Note  that  the 

maximum  entropy  solution  at  small  C  is  non-  maximum  entropy  solution  at  large  C  is  centric 

centric  ((,£),  ^  0).  (<p,  =  0). 


Consider  next  the  non-centric  structure 

p_i  =  0,  po  =  3  0,  Pi  =  1-0,  p2  =  (A4) 

Fo  =  1-0,  |Fi  I  =  0-7906,  (p^  -  18-43°,  =  0-5.  (A5) 

The  variation  of  the  entropy  Sj  for  this  case  is  shown  in  figure  10.  At  low  values  of  C, 
the  maximum  value  of  occurs  close  to  the  true  phase  e.g.  for  C  =  0-05,  we  obtain  (pi 
~  18°.  However,  as  C  increases,  the  maximum  shifts  to  lower  values  of  tpi  until  above  a 
critical  value  of  C  the  maximum  is  at  (pi  =0,  i.e.  a  centric  structure.  This  tendency  of  a 
non-centric  structure  to  move  towards  a  centric  solution  at  high  C  (or  low  R)  is  also 
discussed  in  §  3  with  a  two-dimensional  example. 

The  above  results  suggest  that  the  value  of  C  should  be  set  by  the  user  depending  on 
prior  knowledge  of  the  presence  or  absence  of  a  centre  of  symmetry.  We  have  found  by 
experience  that  very  low  values  of  C  lead  to  slow  convergence  while  very  high  values  of 
C  can  cause  round-off  errors  in  the  calculations.  In  our  work,  with  these  two  auxiliary 
constraints  in  mind,  we  constantly  update  C  at  each  cycle  so  as  to  maintain  the  non¬ 
linearity  parameter  i?  at  a  value  ~  10.  This  has  given  good  results  with  both  centric  and 
non-centric  structures. 
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Abstract.  The  primary  extinction  factor  is  defined  as  the  ratio  of  the  integrated  reflection 
from  a  coherently  diffracting  domain  to  the  integrated  kinematical  reflection  from  the  same 
domain.  When  is  larger  than  05  it  may  be  approximated  by  =  exp{  —  (a<5)^},  where  a  is 
about  0-5  and  b  the  average  size  of  the  coherent  domain  when  measured  in  units  of  the 
extinction  length  A,  5  =  D/A. 

Transfer  equations  are  applied  to  symmetrical  Laue  diffraction,  and  the  reflectivity  per 
unit  length,  a(e)  is  solved  from  the  measured  reflecting  ratio  as  a  function  of  the  rocking  angle 
e  =  9  —  9  g. 

Measurements  with  conventional  x-ray  sources  are  made  on  single  crystal  slabs  of  Be  and  Si 
using  AgK^,  MoK*!  and  CuKa  radiation.  The  primary  extinction  factor  y^fe)  is  solved  from  a 
point-by-point  comparison  of  two  measurements  where  the  extinction  length  A  is  changed  by 
varying  the  polarization  and/or  wavelength  of  the  x-ray  beam.  The  results  show  that  primary 
and  secondary  extinction  are  strongly  correlated,  and  that  the  customary  assumption  of 
independent  size  and  orientation  distributions  of  crystal  mosaics  is  unjustified.  The  structure 
factors  for  Be  and  Si  show  close  agreement  with  other  recent  measurements  and  calculations. 

The  limitations  of  the  method  are  discussed  in  length,  particularly  the  effects  of  beam 
divergences  and  incoherence  of  the  rays  in  the  crystal.  It  is  concluded  that  under  typical 
experimental  conditions  the  requirements  of  the  theory  are  met.  Practical  limitations  arising 
from  the  use  of  characteristic  wavelengths  and  unpolarized  radiation  prohibit  the  use  of  the  full 
potential  of  the  method. 

The  properties  of  a  synchrotron  radiation  source  are  compared  with  a  conventional  x-ray 
source,  and  it  is  demonstrated  that  the  experimental  limitations  can  be  removed  by  the  use  of 
synchrotron  radiation.  A  diffraction  experiment  with  synchrotron  radiation  is  outlined,  as  well 
as  generalization  of  the  method  to  small  spherical  crystals. 

Keywords.  Extinction;  synchrotron  radiation. 


1.  Introduction 

In  his  paper  “The  reflection  of  x-rays  from  imperfect  crystals”  Darwin  (1922)  writes 
about  the  theory  of  extinction  which  he  is  going  to  introduce;  “I  am  afraid  I  have  not 
succeeded  in  welding  the  parts  of  the  argument  rigorously  together,  but  in  spite  of 
certain  gaps  in  the  theory  it  seems  unlikely  that  there  is  serious  error  in  the  general  views 
to  which  it  leads”.  After  more  than  60  years,  our  understanding  of  extinction  is  still 
based  on  Darwin’s  views,  on  the  concepts  of  primary  and  secondary  extinction.  In  the 
main  line  of  crystallography,  correction  schemes  have  been  incorporated  in  procedures 
for  structure  determination,  but  in  a  sense  this  has  been  a  step  backwards,  as  the 
extinction  calculations  are  based  on  the  energy  transfer  equations,  which  cover 
secondary  extinction  only  (Hamilton  1957;  Zachariasen  1967;  Cooper  and  Rouse  1970; 
Coppens  and  Hamilton  1970;  Becker  and  Coppens  1974a,  b,  1975).  Moreover,  the 
extinction  corrections  are  derived  by  least  squares  fitting  to  a  crystal  model  which 
includes  theoretical  structure  factors  and  extinction  parameters.  These  are  highly 
correlated,  and  although  the  extinction  correction  greatly  improves  the  average  quality 
of  the  diffraction  data  it  may  also  introduce  artefacts. 


69 


70 


P  Suortti 


Darwin’s  paper  was  purported  to  provide  understanding  of  the  experimental  results 
by  Bragg  et  a/  (1921,  abbreviated  hereafter  as  bjb).  Since  those  days,  there  has  always 
been  a  line  of  direct  experimental  approach  to  the  problem  of  extinction.  The 
kinematical  limit,  where  the  feedback  between  the  direct  and  reflected  beams  can  be 
ignored,  has  been  approached  in  various  ways.  These  include  methods  where  the  active 
volume  of  the  crystal  or  the  reflectivity  per  unit  volume  are  reduced  (for  references,  see 
Mathieson  1979). 

Some  of  the  basic  ideas  of  the  present  work  can  be  found  already  in  Darwin’s  paper 
where  he  writes:  “The  methods  used  in  bjb  remove  the  secondary  extinction,  but  are 
without  influence  on  the  primary.  In  fact,  it  will  appear  that  no  experiments  of  the 
present  type  can  possibly  remove  it;  indeed,  to  do  so  would  require  the  measurement  of 
the  actual  sizes  of  the  small  blocks  of  perfect  crystal.”  .  .  .  “Now,  at  each  instant  of  the 
rotation  there  will  be  a  different  amount  reflected  and  therefore  a  different  extinction, 
and  consequently  it  will  not  suffice  to  treat  of  the  mean  effect  of  extinction,  without  first 
determining  it  at  every  setting  of  the  crystal.”  In  the  following  we  will  see  that  the 
degrees  of  both  primary  and  secondary  extinctions  are  correlated  to  the  reflection 
profile  and  each  other,  and  that  when  the  actual  sizes  of  the  coherent  domains  are 
expressed  in  units  of  extinction  length  the  sizes  can  be  determined  from  two 
measurements  and  the  effects  of  primary  extinction  removed. 

The  details  of  the  method  have  been  described  earlier  (Suortti  1982a,  b),  and  the  aim 
of  this  paper  is  to  outline  the  method,  give  some  new  results,  discuss  the  limitations  of 
the  theory  and  those  of  an  experiment  with  a  conventional  x-ray  source,  and  finally 
describe  how  the  full  potential  of  the  method  is  utilized  by  the  use  of  synchrotron 
radiation. 

2.  Transfer  equations 

In  the  following  we  will  assume  that  the  radiation  from  the  x-ray  source  is  incoherent. 
For  a  conventional  x-ray  tube  this  is  obvious,  and  even  with  a  synchrotron  radiation 
source  the  possible  effects  of  wave  coherence  are  small.  Consider  the  two  waves  in 
figure  1,  which  illustrates  crystal  domains  with  constant  but  independent  lattice  phases 
G.  When  wave  (1)  enters  domain  (a)  at  an  angle  9,  which  is  close  enough  to  a  Bragg  angle 
dg  —  6  — z,  two  waves  of  electric  displacements  Dj  and  are  excited,  and  these  have  a 
constant  phase  difference  within  (a),  i.e.  the  amplitudes  of  the  waves  are  coupled.  The 
resulting  decrease  of  Dg  is  called  primary  extinction.  On  the  other  hand,  the  difference 
between  and  is  arbitrary,  the  intensities  from  (a)  and  (b)  add  up.  However,  domain 
(b)  is  screened  by  diffraction  in  (a),  in  addition  to  the  normal  absorption,  and  this  loss 
due  to  intensity  coupling  is  called  secondary  extinction.  Also  the  second  diffraction  of 
Dg  in  domain  (c)  decreases  the  reflected  power  P* .  The  flow  of  energy  between  and 
Pg  is  covered  by  the  energy  transfer  equations, 

dP^/dsj  =  +  a_gP*,  (la) 

dP*ldSg^  -^ilP*  +  agP^,  (lb) 

where  (sj,  Sg)  are  oblique  coordinates  along  the  direct  and  diffracted  beams,  respect¬ 
ively.  If  we  assume  a  non-polar  crystal,  ag  =  o_g,  and  ignore  the  possibility  of  the 
Borrmann  effect, 

f^e  =  90  + 


(2) 
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(1)  (2) 


Figure  1.  Diffraction  from  coherent  domains.  Within  each  domain,  the  lattice  phase  G  is 
constant,  but  the  differences  between  G„,  and  are  arbitrary.  and  are  the  electric 
displacements  of  the  direct  and  reflected  beams  excited  in  domain  (a).  The  area  of  coherent 
coupling  for  the  ray  entering  (a)  at  A  and  exiting  at  B  is  indicated  by  and  Sg,  and  the  total  area 
of  wave  coherence  with  dots.  The  shaded  areas  of  (b)  and  (c)  show  incoherent  coupling  of  the 
direct  and  reflected  beams  of  rays  (1)  and  (2). 


where  o  —  <t{e,s^,  Sg)  is  the  reflectivity  per  unit  length  of  the  traversed  crystalline 
medium  and  piQ  the  linear  absorption  coefficient.  The  observable  reflectivity  a{e)  is  the 
integral  of  (7{e,  s^,  Sg)  over  the  region  of  interaction  for  the  extended  incident  beam,  and 
the  integrated  intensity  is  found  by  an  angular  integration. 


3.  Primary  extinction 

The  transfer  equations  for  two  waves  that  have  amplitude  coupling  were  introduced  by 
Takagi  (1962,  1969)  and  Taupin  (1964): 

dDJdsj  =  iK_gDg  exp{iG{Sj,  s^)}, 
dDg/dSg  =  iKgOa  exp{  -  iG(Sd,  s^)}, 

where  the  coupling,  coefficient  is 

Kg^r,ACFg/K.  (3c) 

Here  C  is  the  polarization  factor  (cos  20  when  the  electric  vector  is  in  the  plane  of 
diffraction,  1  when  perpendicular  to  that),  X  the  x-ray  wavelength,  r^  —  e^jmc^  the 
electron  scattering  length,  the  unit-cell  volume,  and  Fg  the  structure  factor.  The 
lattice  phase  G  —  2ng  ■  u,  where  g  is  the  scattering  vector  and  u  the  lattice  distortion. 

The  two  waves  Dj  and  Dg  stay  coherent  within  the  domain  where  G  is  constant.  As 
seen  in  figure  1,  the  maximum  area  of  coherent  coupling  is  s^Sg  sin  20,  and  the 
appropriate  average  is  found  by  an  integration  over  the  exit  surface.  The  “size”  of  the 
coherent  domain  may  be  defined  as  the  average 

D  =  (s^)i  (4) 

Explicit  calculation  is  possible  for  a  given  cross-section  of  a  perfect  crystal,  and 
detailed  results  are  available  for  square  and  circular  shapes  (Olekhnovich  and 
Olekhnovich  1978, 1980).  If  the  incoming  ray  is  taken  as  a  plane  wave,  reflectivity  curves 
R{e)  can  be  calculated;  an  example  is  illustrated  in  figure  2.  The  detailed  form  of  R{e) 


(3a) 

(3b) 
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R(e-r\) 


Figure  2.  Regularly  shaped  coherent  domains  of  size  D,  and  typical  reflectivity  curves  of  a 
domain  for  j)erpendicular  and  parallel  polarizations  of  the  incident  plane  wave. 


depends  on  the  crystal  shape,  wavelength  and  polarization,  but  the  integral  of  K(e)  is 
well  defined  by  k^D,  and  this  is  the  basis  of  the  extinction  correction  to  be  introduced  in 
the  following. 

The  integrated  reflectivity  due  to  domains  of  size  D  is  (Suortti  1982a) 


^  Wo(£) 


R^ie-r})ds  =  Qyp{D)w^{e). 


J 

Ae 


(5) 


Here  r]  is  the  deviation  of  domains  from  the  average  orientation,  Wp(e)  is  the  distribution 
function  of  domain  sizes  (which  is  supposed  to  be  slowly  varying  in  comparison  with 
R[)),  Q  =  (2/sin  203)1  I ^  is  the  integrated  kinematical  reflectivity  per  unit  length  (or 
integrated  reflecting  power  per  unit  volume;  see  James  1962),  and  yp(D)  <  1  is  the 
primary  extinction  factor.  The  observable  reflectivity  is 


<7(£)  =  C  J  yp(f>)Wo(£)d£).  (6) 

The  integrated  reflectivity  per  unit  length  of  the  crystallite,  Q'  =  Qyp{D),  can  be 
expressed  in  terms  of  low-order  Bessel  functions  J2„-  The  argument  of  the  functions  is 

S  =  K^H{^^)i=D/A,  (7) 

where  A  =  (/Cg//)~  ^  =  VJ{rgkCF gli)  is  the  extinction  length.  Here  H  is  the  long-range 
order  parameter,  which  for  a  perfect  crystal  is  the  Debye-Waller  factor  exp(  —  M).  The 
above  means  that  the  arguments  of  the  intensity  expressions  become  linear  in  the  size  of 
the  coherent  domain,  when  this  is  measured  in  the  units  of  A.  The  effective  size  S  can  be 
varied  by  changing  2,  C  or  Fg,  and  this  gives  an  experimental  possibility  for 
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determination  of  primary  extinction. 

At  small  enough  values  of  6  it  follows  from  the  properties  of  J2„  that 

yp(^)  =  272  ^  ‘  I (8) 

where  the  numerical  value  of  a  is  about  0-5.  Model  calculations  (Olekhnovich  and 
Olekhnovich  1978,  1980)  suggest  that  this  approximation  is  valid  when  ^  <  1-5  or 
yp  >  0-5.  It  is  worth  noting  that  the  same  functional  form  can  be  recovered  from 
Darwin’s  (1922)  original  work, 

(^)^tanh-^^  y^Asin^)2} 

the  extra  angular  factor  (sin  0)'  ^  arises  from  the  special  geometry  of  a  laterally  infinite 
plate. 

The  primary  extinction  factor  at  a  given  average  angle  of  incidence  is  weighted  by  the 
distribution  of  domain  sizes. 


yp(e)w(£)  = 


(5)w{^(£)}d5  =  e-i(T(e). 


(9) 


If  we  assume  that  w{5(£)}  is  a  Gaussian  of  width  2A^,  centred  at  S^,  convolution 
resulting  from  substitution  of  (8)  to  (9)  yields 


>p(£)  =  exp{  -  (a3^)V(l  +  =  exp{  -  (a'^J^}.  (10) 


Although  idealized,  this  calculation  suggests  that  the  functional  form  of  (8)  can  be  used 
also  for  real  crystals. 

The  primary  extinction  factor  can  be  determined  from  two  measurements  of  o{s) 
where  the  extinction  length  is  changed.  Writing  and  S2  for  4  in  (10)  and  assuming 
that  the  effective  crystal  volume  is  the  same  in  both  measurements,  one  obtains  from 
(10) 

/(«)  =  =  exp{-(cc'i,)^(l  - 

yp,2i£) 


22  .  <^i(£) 
2i  <72(e)  ‘ 


(11a) 


The  correction  for  primary  extinction  in  measurement  1  is 

(11b) 

where 

=  (11c) 

The  extinction  length  A  can  be  varied  by  changing  the  polarization  factor  C  or  the 
wavelength  L  A  change  in  2.  usually  entails  a  change  of  C,  and 

=  l-{^2C2/^iC,)\  (12a) 


If  the  polarization  of  the  beam  is  parallel  to  the  plane  of  diffraction  in  one 
measurement,  =  |cos20|,  and  perpendicular  to  this  plane  in  the  other,  Cj  =  1, 

=  \-sqc^29.  (12b) 
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4.  Secondary  extinction 


The  energy  transfer  equations  must  be  solved  under  the  boundary  conditions.  A  simple 
closed-form  solution  is  available  only  for  a  parallel-sided  crystal,  but  an  iterative 
solution  can  be  presumably  found  for  any  simple  polyhedral  crystal,  when  the  so-called 
y4B-extinction  formulas  by  Werner  (1974)  are  used.  The  quantity  determined  exper¬ 
imentally  is  called  the  reflecting  ratio 


r*(e) 


Pti± 


(13) 


where  A  is  the  absorption  factor,  and  P^  „  the  incident  beam.  In  the  situation  illustrated 
in  figure  3,  where  the  crystal  is  bathed  in  the  incident  beam,  determination  of  A  P^  „  may 
require  extensive  subsidiary  measurements.  The  corrected  reflecting  ratio,  where  the 
decrease  of  Pj  due  to  diffraction  is  counted  for,  is  defined  by 


'■*(£)  =  ys(e)''(4 


(14a) 


where  y^{e)  is  the  secondary  extinction  factor.  From  conservation  of  energy,  which  is 
demonstrated  in  the  figure. 


r(e)  = 


r*{e) 

I  —r*  (e) 


(14b) 


The  boundary  conditions  come  into  the  play  when  the  reflectivity  a{e)  is  to  be  solved 
from  r(8). 

Laue  diffraction  from  a  parallel-sided  crystal  plate  is  illustrated  in  figure  4,  and  the 


powers  of  the  reflected  and  direct  beams  can  be  written  as 

P* (e)  =  P^  „  exp(  - dP)  sinh(y -  b  T),  (15a) 

Pdi^)  =  Pd,  o  exp(  -oT)!  {yg/a)Ja^  -  b  cosh(ya2  -bT) 

+  ((Mo  +  o')/2ct)(1  -yghd)  ■  sinh(ya^  -b  T).  (15b) 

The  geometrical  factors  are  explained  in  the  legend  of  figure  4,  and 

«  =  i(Mo  +  o’){l/7g  +  l/)'d).  (15c) 

b  =  +  2no(T)/{ygyd).  (15d) 


The  absorption  factor  for  the  transmitted  beam  is  A  —  exp(  —  ixj' /y^),  and  the  observed 


Figure  3.  Crystal  bathed  in  the  incident  beam.  P*  is  the  measured  power  of  the  direct  beam 
and  P*  that  of  the  reflected  beam,  and  AP^  (,  is  the  transmitted  beam  when  there  is  no 
diffraction.  By  conservation  of  energy,  P*  +  P*  =  APj  g. 


Extinction  correction  and  synchrotron  radiation 


75 


Figure  4.  Laue  diffraction  from  a  parallel-sided  crystal  plate.  The  incident  beam 
extends  from  A  to  C,  the  reflected  from  B  to  D.  The  average  orientation  of  reflecting  domains 
deviates  from  the  plate  normal  by  rjo,  and  following  notations  are  used  =  cos(0 -t- //o)> 
V,  =  cos(0-f;o). 


reflecting  ratio 

r*(e)  =  exp{  - iMo ( 1 /7d  “  !/}'<,) 

X  exp{  -i(Tr(l/7a+  l/y^)}  •  smh(^a^-b  T).  (16) 

The  reflectivity  per  unit  length,  (T(e),  can  be  solved  from  the  experimental  r*(£).  The 
formulas  become  cumbersome,  except  in  the  symmetrical  case  where  yg  =  yj  ~  cos  0, 
and  with  (14), 

r(e)  =  sinh((T7’/cos  6)  ~  (T{e)T /cos  9.  (17) 

Small  deviations  from  symmetry  can  be  considered  as  miscut  of  the  crystal,  and 
corrected  for  by  using  series  expansions  of  (15). 

5.  Experimental  results 

The  method  realizes  Darwin’s  appeal  that  extinction  is  to  be  determined  at  each  setting 
£  of  the  crystal.  This  imposes  stringent  bounds  to  the  experimental  situation:  on  the  one 
hand,  the  divergences  should  not  smear  r*  (£)  or  (t(£),  on  the  other,  yp{D)  is  determined 
only  by  integration  over  Rp(e  —  rj).  Therefore,  it  is  advisable  to  estimate  Ae  in  (5)  and 
compare  it  with  typical  widths  of  reflections. 

We  are  interested  in  the  cases  where  yp{S)  <  0-99,  or  6  >  0-2;  for  smaller  coherent 
domains  the  integral  over  Rp,{e  —  rj)  is  always  Q.  The  total  width  of  R^  is  approximately 
2X/D,  and  at  the  above  limit  Ae^  lOA/A.  Table  1  gives  a  few  typical  values  for  the 
strongest  reflections  of  various  crystals,  and  the  results  indicate  that  A£  is  of  the  order  of 
OT  mrad  for  light  materials. 

The  upper  limit  for  the  divergence  is  due  to  the  requirement  that  r*(£)  should  not  be 
appreciably  smeared  by  A£.  The  possible  minimum  width  of  a  reflection  can  be 
estimated  as  follows:  at  the  kinematical  limit  Q  =  where  ^  is  the  integral  width 

of  the  reflection,  and  in  the  present  geometry,  <7^;^  =  r^^-J{T/cos  9).  The  maximum  value 
of  r,ji„(0)  is  1,  which  corresponds  to  r*(0)  =  0-5,  and  so  =  QT/cos  9.  For  instance, 
for  Be(00-2)  measured  with  MoKa  from  a  IT  mm  thick  crystal  is  0-35  mrad,  and 


76 


P  Suortti 


Table  1.  Extinction  lengths  (in  units  of  /im)  and  widths  of  reflectivity  curves  (in  units  of 
mrad)  of  domains  with  D/A  =  0-2  for  the  strongest  reflections  of  various  crystals  at  a  few  x-ray 
wavelengths  2. 


Refl. 

2  = 

A 

0-561  A 

As 

II 

< 

0-710A 

Ae 

II 

< 

1-542  A 

Ae 

Be(00-2) 

30-5 

0-02 

241 

0-03 

111 

0-14 

LiF(200) 

140 

004 

no 

006 

51 

0-30 

Al(lll) 

12-2 

005 

9-7 

0-07 

4-5 

0’34 

Si(220) 

14-9 

004 

11-8 

006 

5-4 

0-29 

Fe(llO) 

41 

014 

3-2 

0-22 

1-5 

103 

for  Si(220)  of  the  same  thickness  1-5  mrad;  these  values  are  an  order  of  magnitude  larger 
than  As  in  table  1. 

The  above  considerations  determined  the  experimental  arrangement,  which  has  been 
explained  in  detail  earlier  (Suortti  1982b).  In  the  case  of  Be,  the  non-dispersive  (1,  —  1) 
arrangement  of  the  monochromator  and  sample  crystal  could  be  used,  and  in  the  case 
of  Si,  Ka2  component  of  radiation  could  be  eliminated  by  narrow  collimation.  The 
measuring  geometry  is  illustrated  in  figure  5.  MoKaj  and  AgK^  were  obtained  by 
reflection  from  a  Si(220)  flat  monochromator,  which  had  a  rocking  curve  width  of 
OT  mrad.  CuKa  was  selected  by  a  polarizer-monochromator  which  was  based  on 
anomalous  transmission  or  Borrmann  effect.  The  electric  vector  of  the  transmitted 
beam  lies  in  the  diffracting  planes  and  can  be  rotated  by  rotating  the  Ge(220)  Borrmann 
crystal  about  the  axis  of  the  x-ray  beam. 


Figure  5.  Measuring  geometry  in  the  plane  of  diffraction  with  a  Borrmann  polarizer  B  (a 
and  b)  and  reflecting  monochromator  M(c).  The  point  focus  of  the  x-ray  tube  is  indicated  by  F, 
the  sample  by  C,  and  the  scintillation  counter  by  SC.  The  forward  diffracted  beam  from  the 
polarizer  is  used,  and  the  other  beam  is  caught  by  a  beam  stop  shown  in  (a).  The  inserts  in  (a) 
and  (c)  show  the  non-dispersive  setting  when  the  Bragg  angle  of  the  polarizer  or 
monochromator  is  equal  to  that  of  the  sample  crystal. 


Extinction  correction  and  synchrotron  radiation 


77 


The  measurement  on  Be  was  made  with  (almost)  unpolarised  MoKaj  and  with 
polarized  CuKa.  Si  was  measured  with  AgK)?  and  MoKaj.  The  reflections  were  step- 
scanned  in  intervals  of  0-05  to  OT  mrad,  and  the  power  of  the  primary  beam  was 
measured  by  the  same  scintillation  detector.  With  MoKaj  and  AgK)3  an  accurately 
calibrated  attenuator  was  needed.  The  reflecting  ratio  r{e),  which  is  corrected  for 
secondary  extinction,  was  calculated  at  each  angle  of  measurement,  and  was 
determined  from  comparison  of  two  measurements  of  the  same  reflection  when  the 
extinction  length  A  was  varied  through  X  or  C.  The  details  of  this  procedure  have  been 
given  earlier  (Suortti  1982b). 

Some  of  the  results  are  reproduced  in  table  2  together  with  earlier  experimental  and 
theoretical  values,  and  plots  of  a  few  reflections  are  shown  in  figure  6.  The  structure 
factors  of  Be  are  based  on  quite  a  few  measurements  at  different  locations  of  the  crystal, 
and  the  agreement  with  other  recent  values  is  very  good,  although  the  average  primary 
and  secondary  extinction  corrections  reach  30  %,  and  the  peak  values  of  ^  ■“  1 
yp  ^  —  1  are  much  larger.  The  implications  of  the  results  have  been  discussed  in  detail  by 
Larsen  and  Hansen  (1983),  who  conclude  that  the  charge  density  distribution  in  Be 
metal  can  be  interpreted  by  sp^  hybridization,  where  the  orbitals  are  directed  towards 
the  tetrahedral  holes.  The  situation  is  well  described  by  an  lcao  calculation  (Dovesi  et 
al  1982),  although  some  of  the  features  are  exaggerated.  The  measurement  on  Si  is  a  real 
test  case  of  the  method,  as  the  true  values  of  F  (hkl)  for  MoKa  radiation  are  known  very 
accurately  from  the  Pendellosung  measurements  by  Aldred  and  Hart  (1973). 
Considering  that  the  experimental  situation  was  quite  unfavourable,  as  the  crystal  was 
too  thick  and  so  the  extinction  corrections  always  large,  the  results  are  very  satisfactory 
and  substantiate  the  soundness  of  the  method. 

It  is  worth  noting  that  the  degree  of  primary  extinction,  y”  ^(e)  —  1,  follows  closely 
the  reflectivity  curve  when  the  reflection  is  narrow,  while  yp{s)  of  a  wide  reflection  is 
almost  constant.  The  former  behaviour  is  not  covered  by  the  customary  version  of  the 


Table  2.  Experimental  and  theoretical  structure  factors  of  Be  and  Si  at  room  temperature. 


Be: 


hkl 

Present 

HSL 

LH 

DPRR 

10-0 

1-90  ±0-01 

1-83 

1-85 

1-914 

00-2 

3-37  ±0-02 

3-32 

3-37 

3-397 

101 

2-78  ±0-03 

2-83 

2-84 

2-810 

Si: 


hkl 

Present 

AH 

111 

59-60 

60-55 

220 

69-10 

dl-U 

113 

44-90 

43-96 

222 

1.48 

1.35  (calc.),  1.50  (ref.) 

004 

56-70 

56-62 

The  entries  for  Be  include  a  measurement  with  y-rays  (Hansen  et  al  1983),  a  conventional 
measurement  from  a  small  crystal  with  AgKa  and  MoKa  radiations  (Larsen  and  Hansen 
1983),  and  results  of  an  lcao  calculation  (Dovesi  et  al  1982).  The  theoretical  values  have  been 
multiplied  by  the  Debye-Waller  factor  exp{- B  sin^  6/X^),  where  =  0-460  (in  basal 
plane)  and  ^33  =  0-41 5  A^  (perpendicular  to  basal  plane).  The  reference  values  for  Si  are  from 
a  Pendellosung  measurement  by  Aldred  and  Hart  1973. 
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Figure  6a.  Reflecting  ratio  r(e)  and  the  corresponding  correction  for  primary  extinction  as  a 
function  of  the  rocking  angle  e  =  6  —  6gO(  the  Be  crystal.  Open  circles  (O)  give  r(£)  =  r2{s)  with 
CuKa  radiation,  and  the  results  with  MoKa  are  brought  to  the  same  scale  by  multiplying  by 
(Qz/Qi)  (cos  01 /cos  dj);  filled  circles  (•)  indicate  r(£)  =  r,(£)  before  and  crosses  (  x  )  after  the 
correction  for  primary  extinction. 


Figure  6b.  See  the  legend  of  figure  6a. 


Figure  6c.  Reflecting  ratio  of  the  measurements  with  polarized  CuKa  radiation.  The  lowest 
curve  is  the  measured  profile  r*(e)  when  C  =  1,  the  middle  one,  r(e),  is  corrected  for  secondary 
extinction,  and  the  top  curve  is  the  kinematical  reflection  profile  calculated  from  the 
comparison  with  the  measurement  with  C  =  |cos20|.  The  scale  is  the  same  as  in  figure  6a. 
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Figure  6d.  Reflecting  ratios  for  Si(l  1 1)  measured  with  unpolarized  AgK^  (filled  circles,  • ) 
and  MoKaj  (open  circles,  O).  For  details,  see  the  legend  of  figure  6a. 


mosaic-crystal  model,  where  the  orientation  and  size  distributions  are  independent. 
This  implies  a  constant  primary  extinction  over  the  profile,  and  the  rest  of  the  effect  is 
termed  secondary  extinction,  which  is  either  orientation-determined  (type  I)  or  size- 
determined  (type  II)  according  to  Zachariasen  (1967)  (see  the  next  section).  The  present 
results  show,  however,  that  ^^(e)  and  ^^(8)  are  often  strongly  correlated,  and  that 
interpretations  based  on  the  mosaic-crystal  model  may  lead  to  false  conclusions  {cf. 
Olekhnovich  et  al  1980). 

6.  Limitations  of  the  method 

Questions  of  validity  concern  the  underlying  principles  as  well  as  the  practical 
realization  of  the  method.  Some  of  the  problems  have  been  discussed  in  detail  in  earlier 
papers  (Suortti  1982a,  b),  and  will  be  only  mentioned  here. 

6.1  Divergences  of  the  beams 

The  central  question  is  that  of  the  divergences  of  the  beams  inside  the  crystal.  In 
Darwin’s  words:  “.  .  .  each  layer  will,  on  account  of  diffraction,  spread  out  incident 
parallel  rays  into  a  certain  range  of  angles  and  so  will  continually  change  the  angle  at 
which  they  attack  the  successive  layers”.  In  practice,  the  incident  beam  has  certain 
divergences,  and  figure  7  illustrates  how  the  divergences  develop  in  diffraction  by 
coherent  domains.  The  divergence  Ae  is  taken  to  be  large  enough  to  cover  R^ie)  when 
<  0-99.  The  largest  domain  reflects  only  a  narrow  band  of  the  available  rays,  and  its 
contribution  to  the  reflectivity  is  (Tj  (e,  s  +  Ae)  =  Qyp  j ;  in  this  case  the  divergence  of  the 
direct  beam  is  retained,  and  the  g^-beam  has  less  divergence.  This  is  convoluted, 
however,  by  the  orientation  distribution  of  the  domains,  and  the  divergence  of  the 
macroscopic  0-beam  roughly  equals  to  that  of  the  d-beam.  The  second  domain  has  a 
reflectivity  curve  which  fits  the  divergence  Ae,  and  so  d-  and  g(-beams  have  equal 
divergences.  For  this  domain  already  cr(e,  e  + Ae)  ~  Q.  The  third,  very  small  domain 
spreads  out  the  gf-beam,  and  the  successive  reflections  make  the  same  happen  to  the  d- 
beam. 

From  the  above  discussion  it  is  clear  that  when  the  domains  are  large  enough  to 
exhibit  primary  extinction  the  divergence  of  the  incident  beam  can  be  selected  such  that 
it  is  retained  in  the  successive  reflections.  In  that  case  the  energy  transfer  equations  (1) 
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Figure  7.  Reflection  of  a  beam  with  divergence  Ae  from  coherent  domains  of  variable  size. 
In  the  lower  part,  the  width  x  and  divergence  x'(  =  e)  of  the  beam  are  illustrated  as  a  phase- 
space  diagram  together  with  the  reflectivity  curves  of  domains  (1)  and  (2). 


apply,  and  the  effects  of  primary  extinction  on  G{e)  are  covered  by  yp(s). 
If  the  effects  of  primary  extinction  are  negligible, 


Rs(e-r})de  =  Q 


J 


(18) 


for  all  d.  The  average  kinematical  reflectivity  for  the  extended  beam  becomes 


J  J 


R^{e-rj)w^{s)dt]dS  =  R{E-t])w{r])dr], 


(19) 


where  R(e  —  rj)  is  the  average  reflectivity  of  the  domains  oriented  in  an  angle  rj,  and  w{rj) 
is  the  normalized  abundance  of  these  domains.  In  his  theory  of  secondary  extinction, 
Zachariasen  (1967)  distinguishes  two  cases  on  the  basis  of  the  wider,  dominant 
distribution. 


^kin(£)  =  2>v(e),  type  I,  (20a) 

^kin(£)  =  ^(^X  type  II.  (20b) 

Figure  8  illustrates  the  effects  of  diffraction  on  the  divergences  in  the  two  cases.  In  the 
type-I  crystal  the  beams  are  not  substantially  spread,  but  in  a  type-II  crystal  the 
divergences  of  the  beams  cover  the  whole  reflecting  range  already  after  a  few  reflections 
inside  the  crystal.  For  a  type-I  crystal  the  preceding  formalism  applies,  and  it  can  be 
shown  that  the  correction  for  secondary  extinction  would  be  correct  to  the  first  order 
also  for  a  type-II  crystal. 

As  pointed  out  by  Kato  (1980a),  the  problems  arising  from  the  convolution  of  scan 
and  divergences  can  be  avoided  when  the  incident  beam  is  a  spherical  wave.  A  first- 
order  correction  for  secondary  extinction  would  still  be  possible,  but  in  a  correction  for 
primary  extinction  a  constant  yp  should  be  assumed. 
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Figure  8.  Ewald  construction  for  diffraction  from  a  type-I  crystal  (a)  and  a  type-II  crystal 
(b).  The  broken  lines  in  (a)  show  the  effect  of  variation  of  the  incident  beam  direction.  The 
inserts  illustrate  the  active  volume  of  the  diffraction  domain^  when  Ae  is  the  divergence  of  the 
incident  beam. 


6.2  Incoherence  of  the  beams 

The  basic  idea  of  the  method  is  that  the  d-  and  ^-beams  experience  a  sufficient  number 
of  reflections  as  to  lose  mutual  coherence.  In  the  terminology  introduced  by  Kato 
(1980b),  a  ray  entering  the  crystal  leaves  the  coherent  channel.  The  number  of 
reflections  experienced  by  a  narrow  ray  can  be  estimated  from  figure  4.  The 
approximation  (8)  is  valid  only  when  ^  <  1-5  or  when  the  domains  are  smaller  than 
3 A/2.  In  the  symmetrical  Laue  geometry,  the  area  of  interaction  for  each  ray  is 
2T^  tan  6,  and  so  there  are  about  (T/A)^  tan  6  domains  of  the  maximum  size  in  this  area. 
Simultaneously  reflecting  is  only  the  fraction  Ae/P,  and  with  the  typical  values  quoted 
earlier,  10  to  100  domains  contribute  to  reflection  of  each  ray. 

Although  the  above  result  suggests  that  the  requirements  of  the  theory  are  met 
in  a  typical  measurement,  it  is  worth  studying  what  happens  to  a  ray  that  stays 
in  the  coherent  channel.  This  means  that  the  g-beam  from  the  first  coherent  domain 
is  not  re-reflected,  and  so  the  contribution  to  the  measured  reflecting  ratio  is 
Sr*{e)  =  5r(£)  =  <t(£,  s^,  This  should  not  be  corrected  for  the  effects  of 

secondary  extinction  through  (14),  as  there  are  none,  but  only  for  primary  extinction. 
Accordingly,  there  is  the  possibility  of  over-correction,  which  should  be  eliminated  by 
maximizing  the  number  of  domains  in  the  area  of  interaction,  so  that  the  coherent 
contribution  to  r*{e)  would  be  small.  In  general,  the  smaller  r*(e)  the  smaller  is  the 
correction  for  secondary  extinction  and  also  the  possible  error.  Multiple  peaks  of 
reflections  are  due  to  diffraction  from  independent  coherent  domains,  and  this  kind  of 
data  should  be  treated  with  special  care. 

The  inverse  problem  of  a  crystal  subject  to  secondary  extinction  only  is  trivial.  In  that 
case  fie)  in  (11a)  is  always  unity,  and  therefore  yp(£)  =  1.  However,  the  reflectivity 
curves  <Xi  (e)  and  <T2(£)  of  a  type-II  crystal  should  not  be  compared,  if  these  are  measured 
with  different  wavelengths,  because  the  width  of  R(e)  is  proportional  to  A.  In  that  case, 
only  the  integrals  are  comparable. 
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6.3  Measuring  geometry 

The  application  of  the  method  requires  that  the  incident  beam  has  small  divergences, 
which  do  not  smear  the  reflection  profile.  In  practice,  Ae  must  be  only  about  OT  mrad.  If 
the  crystal  is  rocked  across  the  beam  only  in  one  direction,  the  other  divergence  is 
allowed  to  be  much  larger,  and  the  required  power  of  Pq  =  10'^  to  10*  c/sec  is  easily 
achieved.  In  the  measurements  with  polarized  x-rays  the  plane  of  diffraction  is  rotated 
by  90°  about  the  axis  of  the  beam,  and  the  solid  angle  of  the  incident  beam  should  be 
only  about  10“ ®  sr.  This  requirement  cannot  be  fulfilled  with  a  conventional  source, 
and  in  the  measurement  quoted  earlier  an  algorithm  was  developed  for  solution  of  ypfe). 
This  was  based  on  assumptions  of  the  relation  between  r*(8)  and  yp{e),  which  was 
deduced  from  measurements  with  two  wavelengths. 

Another  practical  limitation  of  the  method  arises  from  the  discrete  set  of  wavelengths 
that  is  available.  The  wavelengths  should  be  optimized  in  regard  of  absorption  and  to 
make  ^  in  (1 1)  and  (12)  sufficiently  different  from  zero.  In  general,  the  use  of  a  polarized 
beam  should  be  favoured,  as  in  this  case  the  change  in  A  does  not  introduce  any  change 
in  the  illuminated  volume  or  other  geometrical  parameters,  and  more  fundamentally: 
scans  in  reciprocal  space  can  be  made  identical.  However,  29  values  of  the  first 
reflections  may  become  too  small  for  a  polarization  measurement,  and  so  2  must  be 
varied. 


7.  Characteristics  of  synchrotron  radiation 

Synchrotron  radiation  of  x-ray  wavelengths  (sxr)  is  produced  in  storage  rings  where 
the  electron  (or  positron)  energy  is  a  few  GeV.  The  spectral  and  angular  distributions  of 
the  radiation  from  various  elements  of  the  ring  differ  greatly,  and  the  present  discussion 
is  limited  to  the  continuous  spectrum  from  a  bending  magnet  or  wiggler.  Figure  9 
shows  a  universal  spectral  curve.  Typically  B  =  1-2  7,  £  =  2to5  GeV,  and  so  order 
of  1  A.  The  photon  flux  is  integrated  over  the  vertical  direction,  where  the  sxr  beam  is 
well  collimated,  and  a  typical  horizontal  aperture  is  order  of  1  mrad.  The  electron  or 


Figure  9.  Universal  spectral  curve  for  synchrotron  radiation  from  a  bending  magnet  of  an 
electron  (positron)  storage  ring.  The  numbers  correspond  to  unit  of  horizontal  divergence 
(mrad),  electron  beam  current  (mA),  and  electron  energy  (GeV).  The  flux  is  integrated  over  the 
vertical  divergence,  and  is  the  characteristic  wavelength,  which  corresponds  to  the  median  of 
the  power  distribution. 
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positron  current  of  the  present-day  rings  is  about  100  mA,  and  so  the  maximum  flux  to 
a  01  %  bandwidth  {taX/X  =  10~^)  is  10^^  to  10^^  photons  in  second.  The  flux  on  the 
sample  depends  on  the  actual  optical  solution  of  the  beam  line,  and  a  schematic 
construction  with  focussing  mirrors  and  a  two-crystal  monochromator  is  shown  in 
figure  10.  The  insert  shows  a  calculated  intensity  distribution  at  the  sample  site,  and  the 
flux  is  about  10^^  photons/sec  with  AA/A  =  10“"^  (Hastings  eta/ 1983).  The  flux  needed 
for  the  present  method  is  far  less,  but  these  estimates  ensure  that  there  is  sufficient  flux 
at  all  the  wavelengths  needed  even  with  very  narrow  collimation  and  good  energy 
resolution. 

The  intensity  and  divergence  distributions  of  the  sxr  beam  can  be  illustrated  by  a 
phase-space  diagram  (figure  1 1).  Each  electron  under  radial  acceleration  emits  a  cone  of 
radiation,  which  is  linearly  polarized  in  the  plane  of  the  electron  orbit  and  elliptically 
outside;  the  figure  shows  the  polarization  components  separated.  The  photon  opening 
angle  depends  on  the  electron  energy  and  the  x-ray  wavelength.  The  parallel  component 
has  almost  a  Gaussian  distribution  with  variance 

a,  =  0-57y-i(A/4)«-^\  (21) 

where  y  =  E/mc^.  Typically  is  order  of  OT  mrad.  The  beam  sweeps  in  the  horizontal 
plane,  so  the  horizontal  distribution  is  uniform,  and  the  divergence  is  determined  by  the 
horizontal  aperture.  The  electrons  move  in  bunches,  which  have  cross-section  of  order 
1  mm^,  and  the  effective  source  is  the  convolution  of  the  radiation  and  electron  beam 
distributions,  as  illustrated  in  figure  11. 


M  1 


M  2 


I 


Figure  10.  Schematic  construction  of  an  sxR  beam  line  with  1 : 1  focussing.  Ml  and  M2  are 
horizontally  focussing  grazing  incidence  mirrors,  and  the  two-crystal  monochromator  is 
designed  to  retain  the  beam  position.  The  distances  z  are  in  mm  and  typical  for  an  x-ray  beam 
line.  The  lower  part  shows  an  intensity  distribution  at  the  sample  site,  as  obtained  from  a  ray¬ 
tracing  calculation  (Hastings  et  al  1983). 


84 


P  Suortti 


The  phase-space  diagram  propagates  down  the  beam  line.  In  the  general  case  it  is 
convoluted  by  the  focussing  functions,  but  figure  12  shows  a  simple  case  which  does  not 
involve  any  curved  crystals  or  mirrors.  In  the  sxr  source  the  position  and  divergences 
are  coupled,  and  so  the  intensity  distributions  at  the  sample  site  are  non-uniform,  in 
contrast  to  those  from  a  conventional  x-ray  source  (c/  figure  7).  A  scan  by  a  crystal  is 
shown  in  the  figure.  A  small  crystal  with  a  very  narrow  reflectivity  curve  can  be  used  as  a 
probe,  but  in  our  case  the  sample  crystal  has  reflection  profiles  that  are  wide  in 
comparison  with  the  divergence  of  the  sxr  beam.  The  polarization  of  the  beam  is  an 
important  parameter  of  the  experimental  situation.  In  a  simple  case,  as  illustrated  in 
figure  12,  the  degree  of  polarization  could  be  calculated  from  the  source  parameters, 
but  generally  it  must  be  measured.  The  results  in  the  figure  show  that  perfect  linear 


Figure  11.  Components  of  the  radiation  distributions  of  one  electron  (left,  =  in  the  orbit 
plane,  i  perpendicular  to  the  plane).  Synchrotron  radiation  source  in  phase-space  (right, 
vertical  distributions).  The  dotted  line  is  the  l<T-contour  for  the  electron  beam.  The  solid  line  is 
the  contour  of  the  parallel  component,  and  the  broken  line  that  of  the  perpendicular 
component. 


Figure  12.  The  phase-space  diagram  at  distance  z  from  the  source;  the  figure  is  obtained 
from  figure  1 1  by  y  -♦  y  -f  y'z.  The  shaded  rectangle  shows  the  effective  area  of  reflection  from  a 
small  crystal,  and  the  arrow  indicates  the  angular  scan.  The  lower  part  shows  the  measured 
average  distributions  of  the  parallel  and  perpendicular  components  across  the  beam. 
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polarization  is  not  found  even  in  the  orbit  plane  unless  a  polarizing  x-ray  optical 
element  such  as  a  Borrmann  crystal  is  inserted  in  the  beam  line  (Materlik  and  Suortti 
1983). 

8.  Outline  of  extinction  measurement 

The  principle  of  the  determination  of  primary  extinction  is  illustrated  in  figure  13. 
According  to  (8),  the  primary  extinction  factor  of  a  coherent  domain  is  ap¬ 
proximately  a  Gaussian  function  of  A~^  The  width  of  the  Gaussian  is  inversely 
proportional  to  the  size  of  the  domain,  and  generally  varies  with  the  rocking  angle  e  of 
the  crystal.  When  a  conventional  x-ray  source  is  used  Ae  may  be  appreciable,  as  in  the 
measurement  with  a  beam  polarized  in  the  plane  of  diffraction,  and  yp(e)  varies  in  a  non¬ 
linear  fashion  within  Ae.  Moreover,  Aj  and  A2  are  determined  by  the  available 
wavelengths,  so  that  the  situation  cannot  be  optimized.  These  restrictions  are  removed 
by  the  use  of  sxr. 

In  conclusion,  a  practical  geometrical  arrangement,  which  is  planned  for  the 
crystallography  beam  line  of  the  nsls  at  Brookhaven  National  Laboratory  is  depicted 
in  figure  14.  The  4-circle  diffractometer  is  built  only  for  a  vertical  scan,  but  an  extra  scan 
in  the  horizontal  plane  is  easily  adapted.  The  beam  is  monochromatized  with  two  Si 
crystals  in  (n,  —  n)  setting,  and  polarized  with  a  Borrmann  crystal  placed  in  front  of  the 
incident  beam  monitor.  The  estimated  flux  on  1  mm^,  when  there  are  no  focussing 
elements  in  the  beam  line,  is  10®  to  10^  c/sec.  This  high  flux  makes  feasible  very  detailed 
studies  of  the  diffraction  properties  of  crystals  with  a  probing  beam  of  size  10~  ^  mm^ 
or  less. 

The  method  can  be  extended  to  crystals  which  are  used  customarily  in  crystallo¬ 
graphy.  Through  careful  measurements  of  the  intensity  of  the  incident  beam,  r*(£)  can 
be  determined  for  a  crystal  bathed  in  the  beam,  and  the  secondary  extinction  factor  is 
found  by  (14).  Kawamura  and  Kato  (1983)  have  given  the  relation  between  <r,  Hq  and  the 
radius  of  a  spherical  crystal  in  an  easily  applicable  form.  Their  calculation  is  for  the 
average  a  and  y^,  but  it  is  valid  also  for  well  collimated  beams,  if  the  conditions 
discussed  in  §  6.1  are  met. 


Figure  13.  Primary  extinction  factor  as  a  function  of  the  rocking  angle  e  of  the  crystal  and 
the  extinction  length  A.  The  domain  size  5  =  5(e)  is  largest  at  e  =  0.  The  divergence  of  the  beam 
Ae  makes  the  measurement  cover  a  corresponding  range  of  yp-values  (thick  lines),  shown  for 
two  values  of  A. 
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Figure  14.  A  modified  4-circle  diffractometer  at  an  SXR  beam  line;  (a)  vertical  scan, 
(b)  horizontal  scan.  B  is  the  Borrmann  polarizer,  the  incident  power  Pg  is  measured  by  an 
ionization  chamber.  The  crystal  is  set  to  the  reflecting  position  by  the  x-,  <A-.  'P-  and  cu-circles, 
scanned  by  co  and  26  in  the  vertical  plane,  or  by  x  and  26'  in  the  horizontal  plane.  The  additions 
to  the  original  construction  (the  and  20' -circles)  are  needed  for  a  horizontal  scan;  20' -circle  is 
indicated  by  broken  lines.  D  is  the  detector  and  M  the  monitor;  the  roles  are  interchanged  in  the 
two  measurements. 
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Mathematical  methods  for  the  analysis  of  charge  densities  from  x-ray 
diiO^ction’^ 


T  N  GURU  ROW 

Physical  Chemistry  Division,  National  Chemical  Laboratory,  Pune  411008,  India 

Abstract.  Accurate  x-ray  and  neutron  studies  have  demonstrated  that  atomic  deformations 
due  to  chemical  bonding  can  be  reliably  mapped.  Several  new  mathematical  techniques  which 
have  been  devised  to  obtain  quantitative  information  from  these  maps  are  discussed  along  with 
their  relative  merits  and  demerits.  Methods  of  obtaining  theoretical  densities  modified  to 
resemble  the  experimental  framework  are  explained.  The  implications  of  these  modified 
models  are  outlined. 

Keywords.  Electron  density  modelling;  theoretical  approaches;  x-ray  diffraction;  charge 
densities. 

1.  Introduction 

In  the  last  two  decades,  with  the  advent  of  automatic  diffractometers  and  high  speed 
computers,  it  has  become  possible  to  collect  and  process  large  amounts  of  accurate 
x-ray  data.  These  accurate  x-ray  intensity  measurements  combined  with  neutron 
diffraction  results  have  demonstrated  that  the  atomic  deformations  due  to  chemical 
bonding  can  be  reliably  mapped  in  molecular  crystals.  The  so-called  X~N,  X-X  (high 
order)  and  other  related  difference  maps  have  revealed  several  qualitative  features  of 
electron  density  distributions.  Since,  x-ray  scattering  is  primarily  due  to  the  interaction 
of  electromagnetic  radiation  with  the  electrons  in  the  crystal,  the  charge  density  is  the 
main  information  which  is  readily  obtained.  Several  new  mathematical  techniques  have 
been  invoked  to  extract  quantitative  information  from  such  charge  density  distri¬ 
butions.  Direct  evaluation  of  one-electron  properties  using  modified  fourier  series 
summations  is  one  such  approach  (Coppens  and  Guru  Row  1978).  Another  popular 
approach  is  the  modelling  of  density  distributions  by  including  additional  deformation 
parameters  in  the  least  squares  refinement.  There  are  several  models  in  practice  and  in 
this  article  salient  features  of  such  models  will  be  discussed. 

Recently,  developments  in  theoretical  methods  have  facilitated  the  calculation  of 
charge  densities  of  sufficient  accuracy  so  that  these  can  be  compared  with  experimental 
charge  distributions.  However,  these  are  limited  for  molecular  crystals  consisting  of 
only  light  atoms.  These  comparisons  would  reveal  the  deficiencies  in  both  theory  and 
experiment.  The  state  of  art  now  (Coppens  1981)  has  reached  a  stage  that  it  is  possible 
to  derive  properties  like  net  charges,  molecular  dipole  moments  and  electrostatic 
potentials  (Stewart  1973,  1979;  Coppens  et  al  1980)  and  to  fit  atom-centred  spherical 
harmonic  functions  (Hirshfeld  1977;  Stewart  1976;  Hansen  and  Coppens  1978).  The 
theoretical  comparisons,  however,  are  to  be  made  with  those  from  a  very  large  basis  set 
(Stevens  et  al  1978;  Stevens  1980)  and  not  with  semi-empirical  or  an  approximate 
minimal  basis  ab  initio  calculations. 
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2.  Density  modelling 


The  choice  of  a  model  is  in  its  efficiency,  simplicity  and  computer  adoptability.  It  should 
also  fit  the  experimental  density  within  the  accuracy  of  the  experiment.  Since,  the  model 
is  to  be  built  into  the  least  squares  procedure  the  adequate  ratio  of  observations  to 
parameters  should  be  maintained  to  get  meaningful  outputs.  The  model  should  be 
consistent  with  theoretical  properties  expected  out  of  the  system.  An  interesting 
possibility  is  to  use  these  conditions  as  constraints  during  refinement. 

The  electron  density,  in  general,  can  be  represented  in  terms  of  a  finite  number  of 
analytical  functions  as 

p  =  Y.p,m,  (1) 

i 


where  Pi  are  the  corresponding  populations.  These  population  parameters  enter  the 
least  squares  formalism  as  refinable  parameters  along  with  the  usual  parameters  like 
scale  factor,  positional  and  thermal  parameters  etc.  However,  the  electron  density 
defined  in  (1)  is  to  be  convoluted  over  the  probability  functions  corresponding  to  the 
thermal  motion  to  represent  the  actual  dynamic  model  density. 


Pmodei  (dynamic)  = 


yi(r-u)Pi(u)du 


i  « 


(2) 


This  formalism  also  filters  out  the  noise  from  the  experiment  because  have  limited 
flexibility  while  calculating  model  deformation  density  as 


^Pmodel  Pmodel  Z  ^isolated,  spherical  atoms  (3) 

The  second  term  in  expression  (3)  is  usually  calculated  from  Hartree-Fock  wave- 
functions. 


2.1  Two-centre  models 

Earlier  models  tried  to  express  electron  density  as  a  sum  over  atomic  orbital  products 
(Stewart  1969;  Jones  et  al  1972). 

Pmodel  =  Z  Pcore  +  Z  ^7  ^  '  (4) 

jk 

These  models,  however,  soon  became  unpopular  because  the  orbital  products  for  an 
extended  basis  set  became  unmanageable  and  large  correlations  between  orbital 
products  hampered  the  refinement  process.  Also,  the  expression  of  thermal  motion  for 
such  a  two-centre  model  becomes  ambiguous.  However,  there  is  a  model  due  to  Helner 
(1977)  which  is  still  a  two-centre  model,  but  expresses  the  density  in  the  bonding  regions 
and  at  lone  pair  sites  by  ellipsoidal  ‘charge  clouds’.  The  structure  factor  expression  then 
becomes 

^model  =  Z  Pcore  ^XP  (  -  B,^^)  eXp  (2711)  S  '  fi 

t 

+  p.  gxp  ( —  BjS^)  exp  (271:08  •  ri.  (5) 

j 

The  shape  and  size  of  each  ‘charge  cloud’  is  expressed  in  terms  of  an  anisotropic  thermal 
parameter. 
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2.2  One-centre  models 

The  simplest  of  these  would  contain  only  spherical  deformation  functions,  the  spherical 
atom  approximation  is  maintained  and  only  the  occupancy  of  the  valence  shell  is 
allowed  to  refine  (Stewart  1970;  Coppens  et  al  1971) 

Pmodel  ^  \-PcoTe  /^valence  (^)l' 

I 

This  is  also  called  the  L-shell  projection  method.  But,  this  model  does  not  allow  for 
changes  in  shape  of  density  distribution  and  thus  contradicts  chemical  bonding  theory. 
A  more  flexible  formalism,  which  has  become  the  most  popular  of  available  models,  is 
the  radial  or  kappa  refinement  (Coppens  et  al  1979).  Here, 

Pmodel  S  CPcore  ^i/^valence  ("^) 


and  it  allows  for  the  refinement  of  P,,  the  population  of  k,,  the  radial  expansion  and 
contraction  of  the  valence  shell.  The  functions  p^ore  Pvaience  the  Hartree-Fock 
densities  of  the  free  atoms  normalized  to  one  electron.  Obviously,  this  model  is  limited 
to  spherical  or  monopolar  deformations.  A  more  general  fit  to  account  for  the 
asphericity  of  atoms  when  they  become  a  part  of  a  molecule  could  be  obtained  through 
a  multipolar  expansion  of  the  valence  density.  General  multipole  models  (Stewart  1976; 
Hirshfeld  1977;  Hansen  and  Coppens  1978)  aim  at  such  a  fuller  description  of  densities 

T^model  Ph  pseudoatom  (8) 

1 

Each  pseudoatom  is  given  as  a  finite  multipole  expansion  of  the  form 

pfr)=Y,Rfr)Ai_^{e,(f>),  (9) 

i,m 

where  r,  9,  4>,  are  the  polar  coordinates  about  nucleus  (,  Rdr)  is  the  radial  distribution 
function  and  T,  „(0,  cj))  is  the  angular  part. 

In  the  formalism  due  to  Stewart  (1976),  the  pseudoatom  density  is  expressed  as  a 
spherical  core  density  together  with  a  multipolar  valence  density  as 

Piir)  =  Pcore  +  Z  PiMr)Y,r„,  W,  4>),  (10) 

/,  m 

Ri{r)  is  the  radial  part  and  is  expressed  in  terms  of  Slater  type  functions, 

i?,(r)  =  Nr"exp(-Cr),  (11) 

with  N  as  the  normalization  constant.  The  angular  functions  are  real  spherical 
harmonics  expressed  in  terms  of  the  associated  Legendre  polynomials 

yim+  {d,  (f>)  =  PT  (cos  0)COS  (t>, 


with 


yim-  {0,  <p)  =  pT  (cos  9)  sin  0, 


Q_Z2)m/2  Am  +  l- 


2'/! 


dZ 


(12) 

(13) 


For  /  =  4,  (i.e.  functions  upto  the  hexadecapole  level)  there  will  be  25  population  and  / 
radial  parameters  per  atom  which  enter  the  least  squares  formalism. 
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In  another  approach  (Hirshfeld  1977),  the  density  is  expressed  as  deviations  from  the 
overall  atom  density  as 


P.(r)  =  Patom('’)+'<5p,-(r),  (14) 

where 

<5p;(r)  =  £ P,kiV,r"  exp  (-ar)  cos"  (15) 

l,k 

6^  is  the  angle  between  r  and  a  specified  vector  K  chosen  to  minimize  overlap  between 
deformation  functions.  This  model  yields  35  refinable  parameters  (/  =  4)  along  with 
one  radial  parameter  per  atom.  Electroneutrality  is  maintained  through  a  constraint. 

Hansen  and  Coppens  (1978)  in  their  formalism  of  a  multipole  model  combine  the 
kappa  model  with  spherical  harmonic  angular  and  Slater  type  radial  functions  for  all 
/  >  1.  This  yields  26  more  refinable  parameters.  The  density  is  defined  as 

Pi{r)  =  Pc  Pcore  ('■)  +  Py  Pvalence  ('^'■)  +  Nr"  exp  (  -  CO  +  (f/O-  ( 1 6) 

Pc,  Py  and  Pin,  are  population  coefficients  such  that 

P^  +  Py+  Pi{n  =  total  number  of  electrons  (17) 

Pgjjjg  and  Pvalence  Hartree-Fock  densities  normalized  to  one  electron.  The  valence 
function  is  allowed  for  k  refinement.  C  is  another  refinable  radial  parameter.  The 
angular  functions  are  derived  in  terms  of  local  atomic  coordinate  systems  which  allows 
for  non-crystallographic  constraints  during  refinement.  Normalization  is  such  that 

[  \y,„+\smdded(t)  =  2.  (18) 

Jo  Jo 

2.3  Merits  and  demerits 

Among  the  one-centre  models,  the  kappa  refinement  formalism  has  become  most 
popular.  Coppens  once  remarked  that  it  is  a  ‘poor  man’s  charge  density’  approach.  This 
methodology  leads  to  reasonable  estimates  of  several  properties  including  net  atomic 
charges  and  dipole  moments.  The  simplicity  of  this  formulation  even  in  case  of 
molecules  containing  heavier  atoms  is  the  main  reason  for  its  popularity.  The  computer 
program  radiel  (Coppens  et  al  1979)  based  on  this  model  has  been  extensively  used. 
The  multipole  models,  however,  are  more  difficult  to  handle  but  produce  similar  results. 
In  general,  this  aspherical  atom  formalism  leads  to  a  significant  reduction  in  the  error 
function  thus  providing  a  better  physically  meaningful  fit  to  the  experimental  data.  The 
various  multipole  models  proposed  lead  to  similar  results;  which  has  been  demonstrated 
recently  (Baert  et  al  1982).  However,  the  molecular  dipole  moments  derived  from 
Hirshfeld  approach  are  generally  high.  This  is  attributed  to  the  presence  of  diffuse 
monopolar  functions  in  the  multipolar  expansion.  A  serious  drawback  comes  from  the 
experiment.  The  limited  number  of  unique  observations,  the  accuracy  and  the 
resolution  places  a  limitation  on  the  density  features  particularly  near  atomic  centres. 
Correlations  between  functions  on  neighbouring  atoms  lead  to  problems  particularly 
in  multipole  expansion.  In  fact  it  is  not  just  the  correlation,  but  the  radial  extent  of  the 
function  itself  (for  /  >  1)  which  may  have  appreciable  value  at  neighbouring  sites.  The 
program  molly  (Hansen  and  Coppens  1978)  which  builds  the  multipolar  expansion 
into  a  regular  least-squares  program  is  widely  used. 
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3.  Theoretical  methods 

The  advances  in  theoretical  chemistry  and  physics  have  led  to  very  sophisticated  model 
development  of  molecular  systems.  The  quality  of  theoretical  methods,  for  molecular 
crystals,  is  good  enough  for  comparison  with  experimental  densities.  Thermal  motion  is 
the  most  important  factor  to  be  accounted  for  before  such  comparisons  are  attempted. 
There  are  two  ways  in  which  this  can  be  done:  (i)  to  obtain  static  theoretical  density  and 
apply  thermal  smearing;  compare  with  experimental  density,  (ii)  to  deconvolute 
thermal  motion  from  experimental  density;  compare  with  static  theoretical  density.  It  is 
not  the  purpose  of  this  article  to  review  the  various  methods  of  static  density 
calculation.  It  is  enough  to  state  that  the  availability  of  experimental  deformation 
densities  could  be  used  as  a  sensitive  test  of  convergence  on  molecular  crystals.  Earlier 
calculations  involving  semi-empirical,  minimal  basis  set  and  scf  type  formalisms  failed 
to  produce  reasonable  deformation  density  in  the  bonding  regions.  These  also  resulted 
in  large  densities  at  lone-pair  regions.  As  bond  density  is  a  phenomenon  akin  to  chemical 
theory  and  is  consistently  observed  in  experimental  maps,  it  can  be  stated  that  the 
experimental  densities  clearly  discriminate  against  less  advanced  calculations  (Coppens 
and  Stevens  1977).  In  a  recent  study  (Stevens  1980)  theoretical  calculations  of  various 
degrees  of  complexities  by  ab  initio  self-consistent  field  method  using  extended  basis  set 
of  Gaussian  orbitals  have  been  performed.  To  facilitate  detailed  comparison  of  these 
theoretical  densities  with  experimental  deformation  density,  a  second  order  difference 
function 


(19) 


^i^P)  ^Pexp  ^Ptheory,  dynamic 


is  defined.  The  thermally-averaged  theoretical  electron  density  <  p  >  can  be  obtained, 
within  the  Born-Oppenheimer  approximation,  as 


where  is  the  nuclear  distribution  function  and  Rj,  R2, .  .  .  Rjv  are  the  nuclear 
position  vectors.  But,  for  molecular  crystals,  since  the  internal  modes  of  vibration  are 
much  smaller  than  the  external  modes  of  vibration  the  dynamic  theoretical  density  can 
be  calculated  to  a  good  approximation  by  convolution  with  rigid  body  thermal  motions 
(Stevens  et  al  1978).  These  values  can  be  readily  computed  from  experiment 
(Schomaker  and  Trueblood  1968).  There  is  one  more  feature  in  molecular  crystals 
which  in  theoretical  density  estimates  cannot  be  fully  accounted  for.  This  is  the 
intermolecular  interactions. 

The  alternative  approach  of  comparing  experimental  densities  with  static  theoretical 
densities  is  after  deconvolution.  However,  it  is  not  yet  clear  as  to  which  of  the  two 
approaches  is  to  be  preferred.  In  general,  the  comparisons  are  to  be  made  only  with  a 
large  basis  set  calculations  and  the  low  level  calculations  are  not  up  to  the  resolution 
reached  by  experimental  densities.  For  example,  in  case  of  oxalic  acid  (Stevens  1980)  ab 
initio  SCF  calculations  were  performed  using  the  program  hondo  (Dupuis  et  al  1976) 
with  an  extended  basis  set  of  Gaussian  orbitals.  An  extended  basis  set  including 
d-functions,  (11,  5, 1/6,  1)  primitive  Guassians  contracted  to  a  <4,  3, 1/4, 1  >  set  was 
used.  Another  calculation  using  4-31  G  basis  with  (8, 4,0/4, 0)  contracted  to 
<  3,  2, 0/2, 0  >  set  was  also  performed.  The  double  difference  maps  calculated  based  on 
expression  (19)  clearly  indicated  that  the  extended  basis  set  generally  is  in  excellent 
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agreement.  The  4-31  G  set  showed  positive  differences  in  the  bonding  regions,  while  in 
lone  pair  regions  the  differences  were  negative  indicating  the  inadequacies  of  the 
theoretical  density. 


4.  Conclusions 

The  various  models  with  their  adoptability  to  a  least-squares  formalism  have  helped  to 
extract  maximum  information  out  of  the  experiment.  However,  caution  is  advised  not 
to  extend  the  refinable  parameters  to  the  extent  of  sacrificing  the  existing  over- 
determinacy  of  the  data  set.  Systematic  errors  and  the  sharp  fourier  cut-off  still  limits 
the  accuracy.  The  deformation  maps  from  the  various  models,  particularly  from  the 
multipolar  expansion  contain  less  noise  than  the  experimental  X-N  or  X-X  (high 
order)  maps.  It  is  generally  advised  to  carefully  examine  the  residual  maps  before 
attempting  to  evaluate  one-electron  properties.  The  fact  that  the  experimental  density  is 
comparable  only  with  the  density  from  a  very  large  basis  set  is  welcome  since  at  this 
level  it  is  possible  to  look  for  very  small  effects  quite  accurately. 
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Determination  of  one-dimensional  crystal  structures  using  the  double 
Patterson  function 


PETER  MAIN 

Physics  Department,  University  of  York,  Heslington,  York  YOl  5DD,  UK 

Abstract.  The  properties  and  previous  use  of  the  double  Patterson  function  in  x-ray 
crystallography  are  briefly  reviewed  followed  by  an  account  of  a  new  iterative  technique,  based 
on  the  double  Patterson,  which  is  being  developed  by  the  author.  The  technique  starts  with  an 
approximation  to  the  double  Patterson  which  does  not  require  phases,  then  improves  the 
approximation  by  making  it  conform  to  the  known  projections  and  known  magnitudes  of  the 
Fourier  coefficients.  The  latter  are  3-phase  structure  invariants  and  estimating  their  phases  is 
an  important  step  in  the  direct  determination  of  structure  factor  phases.  Tests  carried  out  on 
one-dimensional  centro-symmetric  structures  show  that  the  technique  is  successful.  At  best,  it 
determines  correct  phases  for  all  3-phase  invariants.  At  worst,  it  fails  to  improve  on  the 
estimate  of  all  phases  being  zero.  It  consistently  does  very  much  better  than  the  83  o  formula 
which  is  also  based  on  the  double  Patterson.  Further  development  is  necessary  to  apply  the 
technique  to  non-centrosymmetric  structures  and  to  real  structures  in  three  dimensions. 

Keywords.  X-ray  crystallography;  double  Patterson;  crystal  structure  determination;  direct 
method;  image  reconstruction. 


1.  Introduction 


A  project  to  investigate  the  feasibility  of  using  the  double  Patterson  function  to  solve 
crystal  structures  has  recently  been  started  by  the  author.  Some  of  the  interesting  early 
results  are  presented  in  this  paper,  but  as  very  little  use  has  been  made  of  the  double 
Patterson,  it  will  be  helpful  to  start  with  a  brief  review  of  the  properties  and  previous 
use  of  the  function. 

The  Patterson  function  itself  was  introduced  by  Patterson  (1935) 

P(u)  =  I  p(x)p(x-|-u)dE, 

=  -^ZI^(**)|^exp(-27rih-u).  (1) 

The  importance  of  this  function  stems  from  the  fact  that  its  Fourier  coefficients  are  the 
intensities  lF(h)|^  and  it  requires  no  knowledge  of  phases  for  its  calculation.  It  has  an 
appreciable  value  only  when  u  is  an  interatomic  vector  and  so  gives  information  on 
vector  distances  between  atoms  in  the  structure. 

A  generalisation  of  this  function  was  given  by  Sayre  (1953)  who  pointed  out  the 
existence  of  the  double  Patterson  function,  defined  as: 


Z)(u,  v) 


A 

p(x)p(x  -f  u)p(x  -I-  v)d  V 
Jy 


=  Z  Z  E(fi)f  (k)F(  -  h  -  k)exp  [  -  2niih  •  u  +  k  •  v)] 

h  k 


(2) 
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This  is  a  six-dimensional  function  which  has  an  appreciable  value  only  when  u  and  v  are 
interatomic  vectors  referred  to  the  same  atom  as  origin,  i.e.  if  there  are  atoms  at  vector 
displacements  u  and  v  from  any  other  atom  in  the  structure.  It  is  therefore  a  vector  map 
of  the  structure,  but  unlike  the  Patterson  function,  requires  a  knowledge  of  the  phases 
for  its  calculation.  The  Fourier  coefficients  are  the  3-phase  structure  invariants 
F(h)f  (k)F(  — h  — k)  which  clearly  depend  upon  phases  as  well  as  magnitudes.  This  is 
precisely  the  interest  in  the  double  Patterson  function.  Use  of  the  3-phase  structure 
invariants  forms  the  basis  of  most  direct  methods  of  phase  determination  which  are 
now  widely  used  in  crystal  structure  analysis.  It  follows  that  if  an  approximation  to  the 
double  Patterson  can  somehow  be  obtained,  phases  for  the  3-phase  structure  invariants 
may  be  estimated,  thus  making  direct  phase  determination  very  much  easier. 

Sayre’s  generalisation  went  further  than  equation  (2)  indicates.  He  pointed  out  the 
existence  of  the  set  of  density  functions 


D„(ui,U2.  .  .  u„)  =  p(x)p(x-i-Ui)p(x-t-U2) .  . .  p(x-f-u„)dF 


exp[  -  27r/(hi  •  Uj  -I-  hj  •  U2  .  .  .  +  h„  •  uj] 


(3) 


which  can  have  appreciable  values  only  when  Ui,  U2  .  .  .  u„  are  all  inter-atomic  vectors 
referred  to  the  same  atom  as  origin.  These  functions  were  written  down  explicitly  by 
Vaughan  (1958).  Note  that  Z)o(=F(0))  gives  the  contents  of  the  unit  cell, 
corresponds  to  the  Patterson  function  and  D2  is  the  double  Patterson.  The  Fourier 
coefficients  of  all  these  functions  D„  are  structure  invariants  of  order  n  -I- 1 .  This  paper 
considers  only  the  D2  function  which  uses  the  3-phase  invariants. 

2.  Elementary  properties 

Since  there  is  no  single  source  giving  properties  of  the  double  Patterson,  some  of  the 
more  important  and  useful  of  them  are  collected  together  here. 

2.1  Number  and  content  of  peaks 

From  the  definition  of  the  double  Patterson  function  in  (2),  it  is  possible  to  deduce  that 
the  total  number  of  peaks  in  the  function  is  where  N  is  the  number  of  atoms  in  the 
unit  cell.  These  peaks  are  distributed  as  follows: 

(i)  N  peaks  superimposed  at  the  origin. 

(ii)  N{N  —  1)  peaks  in  each  of  the  principal  sections  D(u,  O),  D(0,  v)  and  D(u,  u), 
excluding  the  origin. 

(iii)  N{N  —  l)(iV  —  2)  peaks  elsewhere  in  the  function. 

The  definition  also  shows  that  the  content  of  each  peak  is  proportional  to  the  product 
of  atomic  numbers  of  the  thtee  atoms  giving  rise  to  the  peak. 

From  (iii)  above,  it  is  clear  that  if  iV  ^  2  there  will  be  no  peaks  in  general  positions  in 
the  double  Patterson.  For  the  set  of  density  functions  given  in  (3),  this  may  be  stated 
more  generally  as 


D„(ui,  U2  .  .  .  u„)  =  0,  Ui  . .  .  u„  0,  when  n^N. 


(4) 
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It  is  interesting  to  compare  this  condition  with  that  for  Karle-Hauptman  determinants 
(Karle  and  Hauptman  1950)  whereby  a  determinant  of  order  m  is  identically  zero  if 
m  >  N,  i.e. 

A„  =  0,  when  m>  N.  (5) 

The  lowest  order  determinant  which  fulfils  this  criterion  is  iV  +  1.  Such  a  determinant 
contains  structure  invariants  up  to  order  iV  + 1  in  its  expansion.  This  is  the  same  as 
the  order  of  structure  invariants  in  the  lowest  order  D„  function  which  fulfils  the 
criterion  (4). 


2.2  Symmetry 

The  complete  symmetry  of  the  double  Patterson  function  has  been  described  by 
Vaisberg  (1974),  so  only  the  symmetry  used  later  in  this  paper  will  be  given  here.  The 
double  Patterson  has  the  same  lattice,  in  the  appropriate  three-dimensional  subspaces, 
as  that  of  the  crystal.  It  also  possesses  a  particular  minimum  symmetry  which  is  easily 
worked  out.  If  there  are  two  atoms  at  vector  displacements  u  and  v  from  a  third,  the  two 
atoms  are  related  by  a  vector  displacement  of  u  —  v.  Taking  each  atom  in  turn  as  origin 
gives  rise  to  three  related  peaks  in  the  double  Patterson.  The  resulting  symmetry  can  be 
represented  by 

D(u,  v)  =  D(-v,  u-v)  =  D(v-u,  -u),  (6) 

which  is  a  3-fold  axis.  In  addition,  the  identities  of  u  and  v  are  arbitrary  and  can 
therefore  be  interchanged.  This  gives 

D(u,  v)  =  D{\,  u),  (7) 

which  is  a  mirror  plane.  This  means  £>(u,  v)  has  a  3-fold  axis  and  a  mirror  plane  as 
minimum  symmetry. 

The  equivalent  symmetry  can  also  be  demonstrated  in  reciprocal  space.  The  h,  k 
Fourier  coefficient  of  D{u,  v)  is  given  by  (2)  as  F(h)F(k)F(  — h  —  k).  Clearly  we  have 

F(h)F(k)F(  -  h  -  k)  =  F(k)F(  -  h  -  k)F(h)  =  F(  -  h  -  k)F(h)F(k), 
and 

F(h)F(k)F(-h-k)  =  F(k)F(h)F(-k-h).  (8) 

For  a  one-dimensional  noncentrosymmetric  crystal,  these  results  show  that  the  two- 
dimensional  space  group  of  the  double  Patterson  must  be  p3ml.  Similarly,  for  a 
centrosymmetric  crystal,  the  symmetry  of  the  double  Patterson  will  be  p6mm. 

Since  the  double  Patterson  is  a  vector  map  of  the  structure,  the  (three-dimensional) 
crystal  space  group  translations  do  not  affect  its  symmetry.  Vaisberg  has  shown  that  the 
number  of  possible  symmetry  groups  of  the  double  Patterson  is  73,  corresponding  to 
the  73  symmorphic  three-dimensional  space  groups. 


2.3  Principal  projections 


The  projection  of  the  double  Patterson  function  on  to  a  three-dimensional  crystal 
subspace  is  given  by 


D(u)  = 


D(u,  v)dv 


7 
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^  ^  F(h)F(k)f  (  — h  -k)exp[  — 27z/(h  •  u  +  k  •  v)]dv 

h  k 

^  |F(h)|^  exp(  — 27tj  h  •  u) 

h 


(9) 


which  is  the  Patterson  function  multiplied  by  F(0)  and  therefore  known.  Because  of 
the  symmetry  of  the  double  Patterson,  there  will  be  three  such  identical  projections. 


2.4  Principal  sections 

Consider  now  the  three  identical  principal  sections  D{u,  O),  D{0,  v),  D(u,  u).  An 
expression  for  D{u,  O)  can  be  obtained  directly  from  (2)  as 

D(u,  O)  =  J  p^(x)p(x  +  u)dP, 

=  Z^(*')^(^)^(-h-k)exp(-27rih  ■  u),  (10) 

^  h  k 

where  the  section  is  seen  to  be  the  convolution  of  the  electron  density  inverted  in  the 
origin  with  the  squared  structure. 

An  interesting  result  is  obtained  if  we  assume  the  structure  factors  obey  Sayre’s 
equation  (1952)  so  that 

f(-h)  =  p^Xf(k)f(-h-k),  (11) 

'9k 

where  /  and  g  are  the  scattering  factors  of  the  real  and  squared  atoms  respectively. 
From  (10)  and  (11)  we  obtain 

D(u,  O)  =  ^X^I^(*')rexp(-27iih-u),  (12) 

which  is  a  sharpened  Patterson  function.  The  three  principal  sections  mentioned  above 
can  therefore  be  calculated  if  the  structure  factors  obey  Sayre’s  equation,  i.e.  if  the 
atoms  are  equal  and  resolved. 


3.  Vector  space  methods 

A  small  number  of  people  have  used  the  double  Patterson  in  different  ways  for  crystal 
structure  analysis.  The  methods  used  can  be  classed  as  either  vector  space  or  reciprocal 
space  methods  and  a  representative  selection  of  both  types  are  presented  in  this  and  the 
next  section. 

As  the  double  Patterson  is  a  six-dimensional  function,  it  is  impractical  to  use  it  as  a 
common  means  of  structure  determination.  It  is  only  recently  that  computers  have 
become  sufficiently  powerful  to  contemplate  calculating  it  at  all.  However,  certain 
three-dimensional  sections  have  interesting  properties  which  have  proved  to  be  useful. 
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3.1  Hoppe  section 

Hoppe  (1957)  considered  the  three-dimensional  section 

D(u,  r)  =  J  p(x  +  u)[p(x)p(x  +  r)]dF  (13) 

where  r  is  a  fixed  vector.  If  r  is  a  unique  interatomic  vector,  the  function  p(x)  p(x  +  r) 
will  contain  a  single  peak.  The  convolution  of  this  with  the  electron  density,  as  in  (13), 
will  therefore  give  a  single  image  of  the  structure.  If  such  a  section  can  be  calculated, 
even  approximately,  the  structure  will  be  solved.  On  the  other  hand,  if  r  is  an  n-fold 
interatomic  vector  D(u,  r)  will  contain  n  translated  and  superimposed  images  of  the 
structure  and  will  not  be  easy  to  interpret.  The  situation  becomes  much  simpler  if  r  is  a 
vector  between  two  heavy  atoms.  In  this  case,  the  peak  in  p(x)p(x  +  r)  corresponding  to 
the  superposition  of  the  heavy  atoms  will  be  much  greater  than  all  other  peaks.  This 
results  in  the  corresponding  image  of  the  structure  in  D(u,  r)  being  stronger  than  the 
rest,  allowing  it  to  be  recognised. 

Simonov  and  Vaisberg  (1970)  made  use  of  the  Hoppe  section  in  the  solution  of  the 
synthetic  silicate  Na2Mn2Si2  07  in  the  space  group  Pl^jn.  Since  phases  are  not  known, 
they  used  the  approximation  that  the  phase  of  F(h)f  (k)f  ( —  h  —  k)  is  zero  for  all  h  and 
k.  It  is  well-known  that  this  is  the  most  likely  phase  if  only  the  unit  cell  contents  are 
known  (Cochran  1955).  If  the  magnitudes  of  the  corresponding  normalized  structure 
factors  are  large,  the  approximation  is  good  for  structures  of  this  size.  The  Mn-Mn 
vector  was  easily  identified  in  an  ordinary  Patterson  function,  then  the  appropriate 
Hoppe  section  based  on  this  vector  was  calculated  using  the  approximate  formula 

0(“.  ■•)  =  4  Z  l''('>)l  1 1 1  f  (k)f  ( -  h  -  k)|  exp(  -  2mk  ■  r) 

^  h  Ik 

exp(-27iih  •  u),  (14) 

where  r  is  the  known  Mn-Mn  vector.  Most  of  the  structure  was  recognised  in  the 
resulting  map. 

A  more  usual  method  of  solving  this  structure  would  be  to  use  the  minimum  function 
(Buerger  1953)  Since  a  unique  heavy  atom  vector  is  available.  Simonov  and  Vaisberg 
did,  in  fact,  calculate  the  minimum  function  based  on  the  Mn-Mn  vector  and  obtained 
very  similar  results  to  the  Hoppe  section.  Since  the  Hoppe  section  requires  a  lot  more 
computing,  it  would  appear  to  offer  no  advantage  over  the  minimum  function  under 
these  circumstances. 

3.2  Symmetry  section 

Let  us  assume  the  crystal  space  group  contains  a  pair  of  symmetry  elements  such  that  an 
atom  at  r  is  reproduced  at  Cjr  -I-  dj  and  C2r  -I-  d2,  where  Ci  and  C2  are  point  group 
operators  and  di  and  d2  are  translation  vectors.  Taking  the  point  r  as  origin,  the  vector 
distances  to  the  other  two  atoms  are 

u  =  CiT-l-di -r,  (15) 

and 

V  =  C2r  +  d2  -r. 
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That  is,  a  single  atom  at  r  gives  rise  to  a  peak  at  (u,  v)  given  by  (15)  in  the  double 
Patterson  due  to  this  pair  of  symmetry  elements.  The  double  Patterson  can  therefore  be 
expressed  in  terms  of  r  only  as 

=  4X1  F{h)F{k)Fi  -  h  -  k)  exp { -  2ni[h^{C,  - 1) 

^  h  k 

+  k^(C2-I)]  •  r  +  h  •  dj +k  •  d2}.  (16) 

Such  a  three-dimensional  section  of  the  double  Patterson  is  called  a  symmetry  section 
and  it  contains  peaks  at  atomic  sites.  Symmetry  sections  are  the  equivalent  of  Marker 
sections  in  ordinary  Patterson  functions  and  have  been  studied  by  Biyushkin  (1973). 

As  with  the  calculation  of  the  Hoppe  section,  phases  wilt  not  be  known  but  the 
approximation  that  all  phases  are  zero  may  be  made.  Since  the  peak  heights  are 
proportional  to  Z^,  heavy  atones  should  be  identified  easily  even  with  this  approxima¬ 
tion  to  the  phases.  Biyushkin  and  Belov  (1965)  calculated  such  approximations  to 
symmetry  sections  in  order  to  solve  CoCl(N02)2  •  (NH3)  •  CH2C(NH2)2  in  the  space 
group  P2i  /c.  They  calculated  two  different  two-dimensional  sections  through  the  four- 
dimensional  double  Patterson  of  a  projection  of  the  structure.  The  symmetry  sections 
gave  the  coordinates  of  the  Co  and  Cl  atoms  and  one  light  atom,  from  which  the 
complete  structure  was  obtained  using  weighted  Fourier  syntheses. 

4.  Reciprocal  space  methods 

In  both  examples  of  vector  space  methods,  the  double  Patterson  function  or  its  sections 
were  calculated  under  the  assumption  that  the  phase  of  f  (h)F(k)F(  —  h  —  k)  is  zero  for 
all  h  and  k.  This  is  precisely  the  assumption  made  in  direct  methods  of  phase 
determination  which  are  now  widely  used  in  crystal  structure  analysis.  It  would  seem 
therefore  that  double  Patterson  methods  as  described  in  the  previous  section  add  no 
new  information  to  the  analysis  and  so  cannot  be  made  more  powerful  than  direct 
methods.  In  order  to  make  progress,  better  estimates  for  the  phases  of  the  double 
Patterson  coefficients  must  be  obtained.  This  is  the  purpose  of  the  B3  0  formula  of 
Karle  and  Hauptman  (1958)  which  is  based  on  an  approximation  to  the  double 
Patterson  function  as  shown  by  Vaughan  (1958)  and  will  be  derived  here. 

Consider  the  product  of  three  Patterson  functions  defined  by 

Q(u,v)  =  P(u)P(v)P(u-v).  (17) 

If  u,  V  and  u  —  v  are  all  interatomic  vectors,  Q(u,  v)  will  have  a  large  value.  This  should  be 
compared  with  the  double  Patterson  function  D(u,  v),  which  has  a  large  value  when  u 
and  V  are  interatomic  vectors  referred  to  the  same  atom  as  origin.  Thus  Q(u,  v),  which 
can  be  calculated  without  knowledge  of  phases,  contains  all  the  peaks  present  in  D(u,  v). 
Unfortunately,  it  also  contains  many  more  peaks  not  present  in  the  double  Patterson 
and  it  will  always  be  centrosymmetric.  However,  using  Q(u,  v)  as  an  approximation  to 
D(u,  v)  gives  an  estimate  of  the  Fourier  coefficients  of  the  double  Patterson. 

The  Fourier  transform  of  the  product  of  Pattersons  involves  the  convolution  of  their 
Fourier  coefficients.  Leaving  out  the  origin  peak  and  the  principal  sections  of  Q(u,  v), 
the  h,  k  Fourier  coefficient  is  proportional  to 

<(|£(l)|2-l)(i£(h  +  l)|2-l)(|£(h  +  k  +  l)|2 -!)>,. 


(18) 
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To  put  this  on  the  correct  scale,  we  divide  by  the  average  value  of  (|£(l)|^  —  1)^  and 
multiply  by  the  total  double  Patterson  density  represented  by  (18).  The  latter  is  given  by 
the  total  density  less  that  contained  in  the  three  missing  sections,  giving 


(<Ti^-3(Tia2+2(T3) 


<(|£(l)|^-l)(|£(h  +  l)|^-l)(|£(h  +  k  +  l)|^-l>, 

<(|£(1)|2-1)3>, 


,(19) 


N 

where  a„—  ^  Z". 
j=  1 

The  three  principal  sections  are  restored  by  assuming  they  are  Patterson  functions  as  in 
(12),  giving  the  quantity 


((Ti(T2-(T3) 


(|£(h)|^ 


l)  +  (|£(k)|^-l)  +  (|£(-h-.k)|^-l) 

<|£(l)|^-l>, 


(20) 


where  the  origin  peak  has  also  been  added.  Since  the  function  2(u,  v)  is  centrosym- 
metric,  the  Fourier  coefficient  represented  by  (19)  and  (20)  is  always  real  and  is  given  by 

(72^/" l£(h)£(k)£( - h  - k)|  cos(0(h)  +  (/)(k)  +  </>(-  h  - k)).  (21) 

Remembering  that  <  |£(1)|^  —  1  >,  =  1,  a  combination  of  (19),  (20)  and  (21)  yields  the 
formula 

j£(h)£(k)£(-h-k)|  cos((/)(h)  +  (/)(k)  +  (/)(-h-k))  =  — - 


<(|£(l)|^-l)(|£(h  +  l)|^-l)(|£(h  +  k  +  l)|^-l)>, 

<(|£(I)|2-1)^>, 


(22) 


+  (|£(h)|^  +  |£(k)|^  +  |£(  -  h  -k)|^  -  3)  +  -^ 

(72  O2 

The  £3^0  formula  for  unequal  atoms  is  given  by  Hauptman  (1964)  as 


|£(h)£(k)£(-'h-k)|  cos((/»(h)  +  (/)(k)  +  </)(-h-k))  =  — - ^^^212 

(72 

<(|£(l)|^-l)(|£(h  +  l)|^-l)(|£(h  +  k  +  l)|^-l)>, 

<(i£(l)r-l)^>,  ^ 


+  -%(|£(h)P  +  |£(k)|^  +  |£(-h-k)|^-2), 

(72 

which  differs  from  (22)  in  the  magnitude  of  the  second  term.  Since  this  is  of  the  order  of 
1/N  of  the  first  term,  there  is  little  practical  difference  between  the  two  formulae. 

The  reason  why  this  formula  does  not  reliably  indicate  the  value  of  cos((/)(h)  +  </)(k) 
+  (/)(  — h  —  k))  is  that  Q(u,  v)  is  too  poor  an  approximation  to  D(u,  v).  For  all  but  the 
most  trivial  structures,  most  of  the  peaks  in  Q  (u,  v)  do  not  belong  to  D  (u,  v)  at  all  and  the 
proportion  of  wrong  peaks  increases  with  the  size  of  the  structure.  This  is  seen  in 
figure  1  which  shows  the  double  Patterson  of  a  four-atom  one-dimensional  centrosym- 
metric  structure  compared  with  the  corresponding  Q(u,  v)  function.  Even  for  this 
simple  structure,  a  large  proportion  of  the  peaks  off  the  principal  sections  are  spurious. 
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Figure  1.  The  double  Patterson  function  for  a  one-dimensional  centrosymmetric  structure 
with  two  atoms  in  the  asymmetric  unit.  The  numbers  give  the  peak  heights  and  positions.  The 
dots  indicate  the  positions  of  additional  (spurious)  peaks  which  will  also  appear  in  the  Q(u,  v) 
function. 


The  successful  applications  of  B3  o  have  mainly  been  with  fairly  small  structures  and 
almost  never  in  a  situation  where  the  structure  could  not  be  solved  by  other  means. 

5.  A  new  iterative  technique 

In  the  previous  section  it  was  shown  that  the  83  0  formula  was  based  on  a  very  poor 
approximation  to  the  double  Patterson  function.  One  way  of  improving  the 
performance  of  B3  0  will  be  to  obtain  a  better  approximation  than  the  Q{u,  v)  function. 
Fortunately  this  should  be  possible,  since  not  all  the  available  information  on  the 
double  Patterson  has  been  used  to  set  up  Q(u,  v).  An  obvious  omission  is  that  the 
known  projections  of  the  double  Patterson  have  not  been  used  and,  indeed,  Q(u,  v)  will 
not  give  the  correct  projections  at  all.  It  is  proposed,  therefore,  to  use  ^(u,  v)  as  a  first 
approximation  to  D(u,  v)  and  then  to  modify  it  by  making  it  compatible  with  the  three 
known  principal  projections  described  in  §  2.3. 

Reconstructing  images  from  projections  is  now  a  well-established  process  and  many 
different  techniques  for  doing  it  have  been  developed.  The  ones  worth  considering  for 
the  present  application  are  the  algebraic  reconstruction  technique  (art)  (Gordon  et  al 
1970)  and  the  simultaneous  iterative  reconstruction  technique  (sirt)  (Gilbert  1972)  in 
either  their  additive  or  multiplicative  forms.  Since  the  number  of  known  projections  of 
the  double  Patterson  function  is  very  small  (only  three),  the  problem  of  reconstructing 
the  function  from  them  is  very  underdetermined.  However,  Minerbo  and  Sanderson 
(1977)  have  investigated  the  formation  of  an  image  from  only  two  or  three  projections 
and  they  report  acceptable  results  for  simple  images  when  the  multiplicative  version  of 
ART  is  used.  They  also  show  this  corresponds  to  the  maximum  entropy  solution  in  the 
case  of  three  projections.  A  multiplicative  algorithm  was  therefore  chosen  to 
reconstruct  the  double  Patterson.  Since  it  is  easier  to  maintain  the  symmetry  of  the 
double  Patterson  using  sirt,  the  multiplicative  version  of  sirt  (msirt)  was  used. 

This  is  an  iterative  technique  which  operates  on  the  double  Patterson  function 
evaluated  on  a  grid  of  points  D(u„  Vj).  It  updates  the  value  of  each  grid  point  according 
to 


DP+‘(Ui,  Vj)  =  DP{Ui,yj)SiSjSi+j, 
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where 


Si  =  (TiP(u,)/^DP(u,-,  vj, 


(24) 


k 


and  p  is  the  number  of  the  iteration.  The  effect  is  to  modify  the  double  Patterson  so  its 
three  principal  projections  match  the  known  projections  aiP(u). 

A  small  amount  of  additional  information  may  be  used  if  it  is  assumed  that  the 
principal  sections  of  D(u,\)  are  also  known  as  described  in  §  2.4.  In  this  case  the  sections 
are  left  out  of  the  calculation  of  scale  factors  in  (24)  and  they  are  also  left  unmodified  by 
the  MsiRT  algorithm.  The  scale  factors  are  now  calculated  as 


(25) 


k 


Perfect  reconstruction  of  the  double  Patterson  function  will  not  be  possible  in  a 
single  application  of  msirt  since  the  problem  is  severely  under-determined.  At  most, 
only  a  small  improvement  over  <2  (“»'')  can  be  expected.  However,  still  more 
information  is  available  since  the  magnitudes  of  the  Fourier  coefficients  of  D(u,  v)  are 
all  known.  The  simplest  way  to  make  use  of  this  is  to  transform  the  approximate  D{u,  v) 
into  reciprocal  space,  correct  the  Fourier  coefficients  to  match  the  known  magnitudes 
and  then  to  transform  them  back  again.  Using  the  projections  in  vector  space  and  the 
magnitudes  in  reciprocal  space  should  bring  a  lot  more  information  to  bear  on  the 
double  Patterson  than  is  used  in  the  original  q  formula. 

All  of  this  can  be  brought  together  in  the  following  iterative  procedure  to  obtain  a 
good  estimate  of  the  double  Patterson  function: 

(a)  use  g(u,  v)  defined  in  (17)  as  a  first  approximation  to  D(u,  v); 

(b)  scale  the  density  to  match  the  known  total  density  (total  density  off  principal 
sections  of  D(u,  v)  is  af  —  3a  102  +  2(73)  and  set  all  negative  regions  of  Z)(u,  v)  to 
zero; 

(c)  make  the  approximate  D(u,  v)  compatible  with  the  known  projections  by 
applying  msirt; 

(d)  for  equal-atom  structures,  the  principal  sections  of  D(u,  v)  can  be  set  equal  to  their 
known  values; 

(e)  transform  to  reciprocal  space; 

(f)  alter  the  calculated  magnitudes  | (h,  k) |  such  that  if  |£j^,j,(h,  k)|  >£p(jj(h,  k) 
then  |£cait.(h,  k)|  =£oi35(h,  k).  Other  magnitudes  and  all  phases  are  left  unchanged. 

(g)  other  relationships  among  structure  factors  may  be  used  here  such  as  Sayre’s 
equation  to  refine  phases — see  the  next  section  for  a  description  of  this; 

(h)  transform  back  to  vector  space  and  repeat  from  (b)  until  the  double  Patterson  has 
converged  or  until  reasonable  estimates  of  the  phases  of  the  Fourier  coefficients 
have  been  obtained. 

6.  Results 

The  iterative  procedure  described  in  the  last  section  requires  a  huge  amount  of 
computing,  even  for  small  structures.  However,  if  it  forms  the  basis  of  a  method  of 
solving  structures  that  could  not  otherwise  be  solved,  it  will  be  worth  developing.  To  see 
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how  the  procedure  behaves,  it  was  applied  to  one-dimensional  centrosymmetric  model 
structures.  The  electron  density  was  made  discrete,  i.e.  it  existed  on  a  grid  of  evenly 
spaced  points,  and  it  consisted  of  equal  point  atoms.  This  bears  little  resemblance  to 
real  structures,  but  it  reduced  the  amount  of  computing  to  easily  manageable 
proportions. 

As  might  be  expected,  the  structures  which  were  easy  to  solve  were  those  in  which  the 
electron  density  was  predominantly  zero  with  only  a  small  number  of  atoms.  The  q 
formula  correctly  calculated  the  phases  of  the  strongest  Fourier  coefficients  and  it  was 
unnecessary  to  use  any  better  approximation  to  the  double  Patterson.  Likewise,  those 
structures  which  contained  a  large  number  of  atoms  with  only  a  few  points  of  zero 
electron  density  were  also  easily  solved.  In  this  case,  the  signs  of  the  Fourier  coefficients 
were  predominantly  negative  and  were  correctly  calculated  by  q. 

Between  these  two  extremes,  a  range  of  structures  could  be  set  up  of  various  degrees 
of  difficulty.  Three  examples  are  set  out  in  table  1 .  The  electron  density  is  evaluated  at 
32  points  in  the  unit  cell  and  only  the  asymmetric  unit  is  given  in  the  table.  A  measure  of 
the  progress  of  the  technique  is  given  by  the  weighted  rms  phase  error  of  the  double 
Patterson  Fourier  coefficients  £(h)£(k)£(  — h  —  k),  the  weights  being  the  magnitudes 
of  the  coefficients.  The  very  weakest  coefficients  whose  phases  are  not  of  interest  were 
left  out  of  the  average.  At  iteration  0,  the  rms  error  is  calculated  using  an  estimate  of 
zero  for  all  the  phases,  which  is  the  normal  estimate  in  direct  methods.  Iteration  1 
represents  the  results  of  applying  B^  q,  that  is,  phases  calculated  directly  from  the 
Q(u,  v)  function.  Thereafter  each  iteration  consists  of  correcting  the  magnitudes  in 
reciprocal  space  and  modifying  the  double  Patterson  in  vector  space  as  described  in  the 
previous  section. 

In  the  first  example  B^^q  is  seen  to  provide  very  little  improvement  in  the  phases,  but 
all  phases  are  determined  accurately  after  a  few  iterations  of  improving  the  double 
Patterson.  It  was  found  that  making  use  of  Sayre’s  equation  at  step  (g)  of  the  algorithm 
speeded  up  convergence  enormously,  although  it  was  not  necessary  for  the  eventual 
success  of  the  method.  Sayre’s  equation  in  (11)  can  be  modifed  to  give 

=  (26) 


Table  1.  Three  examples  of  one-dimensional  structures  referred  to  in  the  text.  The  phase 
errors  are  those  of  the  double  Patterson  Fourier  coefficients  (3-phase  invariants)  and  are 
expressed  in  degrees.  Note  that  random  phases  will  produce  an  rms  error  of  127°. 


(a)  asymmetric  unit  of  p{x) 

0 

1 

1 

0 

0 

0 

0 

1 

1 

0 

0 

1 

0 

0 

0 

1 

0 

asymmetric  unit  of  P(u) 

12 

4 

2 

4 

5 

2 

6 

4 

4 

8 

7 

0_ 

2 

6 

5 

4 

6 

iteration  number 

0 

1 

2 

3 

4 

5 

6 

7 

wtd.  rms  phase  error 

109 
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77 

35 

19 

10 

6 

0 

(b)  asymmetric  unit  of  p(x) 

0 

0 

1 

1 

0 

0 

0 

1 

1 

0 

0 

1 

0 

0 

0 

1 

0 

asymmetric  unit  of  P{u) 

12 

4 

1 

2 

7 

6 

5 

2 

4 

6 

7 

2 

2 

6 

7 

4 

2 

iteration  number 

0 

1 

2 

3 

4 

5 

6 

7 

wtd.  rms  phase  error 

113 

122 

52 

36 

33 

34 

35 

37 

(c)  asymmetric  unit  of  p{x) 

0 

0 

1 

1 

0 

0 

1 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

asymmetric  unit  of  P{u) 

12 

2 

3 

4 

5 

6 

5 

2 

4 

8 

3 

4 

3 

6 

6 

4 

2 

iteration  number 

0 

1 

2 

3 

4 

5. 

20 

wtd.  rms  phase  error 

113 

128  124 

126  125  121 . 

113 

Structure  determination  using  the  double  Patterson 


105 


in  which  the  right  side  represents  a  projection  in  reciprocal  space  of  the  double 
Patterson  coefficients.  The  left  hand  side  is  independent  of  phase  and  so  is  always  known. 
That  is,  Sayre’s  equation  gives  known  projections  in  reciprocal  space,  corresponding  to 
the  known  sections  in  vector  space.  A  simple,  though  unconventional,  way  of  applying 
Sayre’s  equation  is  to  apply  sirt  to  modify  the  double  Patterson  coefficients  to  give  the 
correct  projections  and  this  is  what  was  done. 

The  second  example  in  table  1  is  a  more  difficult  structure  than  the  first.  It  can  be  seen 
that  the  B3  o  results  give  worse  estimates  of  phases  than  assuming  they  are  all  zero.  The 
double  Patterson  iteration,  on  the  other  hand,  rapidly  improves  the  phases  and 
converges  to  an  rms  error  of  about  35°.  Even  though  this  is  not  a  complete  success,  the 
improvement  in  phase  estimates  is  significant.  As  a  measure  of  this  improvement,  an 
attempt  was  made  to  solve  the  structure  using  conventional  symbolic  addition  with 
estimates  of  zero  for  the  3-phase  invariants  and  then  with  estimates  given  by  the  double 
Patterson  iteration  technique.  In  both  cases,  sixteen  plausible  sets  of  signs  were 
obtained  for  the  ten  largest  £’s  and  the  quantity  ^^£(h)£(k)£(  —  h  —  k)  was  calculated 
for  each  set  as  a  simple  figure  of  merit.  In  the  conventional  calculation,  the  correct  set  of 
signs  gave  the  worst  figure  of  merit.  With  the  double  Patterson  coefficients  as  the  3- 
phase  invariants,  the  correct  set  of  signs  gave  the  best  figure  of  merit.  Clearly,  the 
structure  is  solved  much  more  easily  after  double  Patterson  iteration. 

The  third  example  is  of  a  more  difficult  structure  still.  Here  the  £3  0  formula 
calculates  random  phases  and  the  double  Patterson  iteration  cannot  improve  on  the 
original  estimates  of  all  phases  equal  to  zero. 


7.  Discussion 

In  all  structures  tested,  the  double  Patterson  iteration  technique  did  significantly  better 
than  the  £3  0  formula  in  estimating  the  signs  of  the  3-phase  invariants.  At  worst,  it  was 
as  good  as  assuming  all  signs  were  positive  as  in  the  example  in  table  1  (c)  and  at  best  was 
capable  of  estimating  all  signs  correctly,  even  starting  from  an  rms  error  of  over  100°  as 
in  table  1(a). 

There  are  two  features  of  the  structures  tested  which  determine  their  difficulty.  One  is 
the  number  of  zeros  in  the  Patterson  function.  The  structure  in  table  1  (a)  produces  a 
single  zero  in  the  Patterson  whereas  the  example  in  table  1  (b)  has  no  zeros.  Since  there  is 
a  restriction  that  the  double  Patterson  density  cannot  be  negative,  a  zero  in  the 
Patterson  function  automatically  leads  to  correct  values  at  all  points  in  the  double 
Patterson  contributing  to  that  point  in  the  projection.  Therefore,  the  more  zeros  there 
are  in  the  Patterson  function,  the  easier  the  structure. 

The  second  feature  for  consideration  is  how  flat  the  Patterson  function  is.  Structures 
whose  Patterson  functions  are  fairly  flat  are  more  difficult  to  solve  than  those  with 
prominent  peaks  or  deep  troughs.  The  example  in  table  1(c)  has  a  much  flatter 
Patterson  than  that  in  table  1  (b)  and  it  has  already  been  shown  to  be  a  more  difficult 
structure.  The  variances  of  the  Patterson  density  are  2.7  and  4.3  respectively. 

An  examination  of  the  structures  that  failed  to  solve,  or  were  difficult,  revealed 
another  interesting  fact.  A  large  proportion  of  the  signs  of  the  Zj-type  coefficients,  i.e. 
those  of  the  form  £  (h)  £  (h)  £  ( —  2h),  were  wrong.  The  Z 1  formula  can  often  give  wrong 
results,  but  it  is  disturbing  that  under  perfect  conditions,  without  experimental  error, 
the  approximation  to  the  double  Patterson  used  here  also  estimates  these  signs 
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wrongly.  It  ought  to  be  more  reliable  than  the  Zj  formula  itself.  This  has  led  the  author 
to  abandon  the  Ij  formula  as  an  indicator  of  phases  prior  to  phase  determination  by 
the  tangent  formula  of  Karle  and  Hauptman  (1956). 

It  is  clear  from  these  tests  that  information  about  3-phase  invariants  can  be  obtained 
by  objective  procedures  involving  the  double  Patterson  function.  It  is  interesting  that 
no  use  is  made  of  probability  relationships  as  in  conventional  direct  methods;  neither  is 
Sayre’s  equation  necessary,  although  it  is  useful  to  accelerate  the  convergence  of  the 
procedure.  Unfortunately,  it  is  impossible  to  predict  how  the  technique  will  behave 
with  real  crystal  structures  in  three  dimensions.  More  elaborate  testing  will  have  to  be 
done  to  determine  this. 

A  problem  not  yet  solved  is  how  to  define  the  enantiomorph  for  non- 
centrosymmetric  structures.  It  is  possible  that  the  determination  of  cos(<^(h)-l-(/)(k) 
+  </)(  —  h  —  k))  may  be  sufficient,  but  it  would  be  more  satisfactory  if  the  actual  phases 
could  be  obtained. 

The  larger  the  structure,  the  more  under-determined  is  the  image  reconstruction  from 
projections  in  vector  space.  This  may  well  be  the  limiting  factor  in  the  whole  procedure, 
although  where  the  limit  lies  is  completely  unknown  as  yet. 

The  computing  time  will  be  large  for  real  structures,  but  the  fastest  computers  should 
now  be  able  to  handle  a  realistic  calculation.  This  will  limit  the  method  to  those  crystal 
structures  that  cannot  otherwise  be  solved,  or  require  several  man-years  work  as  in 
macromolecular  crystallography.  Even  if  the  method  proves  to  be  of  no  practical  use, 
the  results  presented  in  this  paper  are  very  interesting  in  themselves. 
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Precision  of  determination  of  d-spacings  using  a  Guinier  camera 


SVEND  ERIK  RASMUSSEN 

Chemistry  Department,  Aarhus  University,  DK-8000  Aarhus  C,  Denmark 

Abstract.  The  precision  of  determination  of  lattice  spacings  using  a  commercial  triple  Guinier 
camera  and  a  precision  comparator  is  evaluated.  Determination  of  lattice  constants  on 
germanium,  tungsten  and  various  samples  of  mullite  are  given.  It  is  found  that  d-spacings  can 
be  routinely  determined  with  uncertainties  of  the  order  of  1  part  in  10000  in  the  analytically 
most  useful  Bragg  angle  range  from  0  to  45°  in  theta. 

Keywords.  Precision;  d-spacings;  Guinier  camera. 


1.  Introduction 

For  a  perfect  camera  the  uncertainty  in  d-spacing  measurements  is  given  by  the 
expression:  Ati/ti  =  cot  dA6,  assuming  that  the  wavelength  of  the  x-ray  beam  is  either 
known  accurately  or  is  considered  a  reference  value.  For  cubic  crystals  the  relative 
uncertainty  of  the  lattice  constant  Aa/a  likewise  varies  as  cot  6A6.  It  has  become 
standard  practice  to  determine  lattice  constants  for  cubic  and  other  crystals  using 
extrapolation  methods  as  the  uncertainty  of  a  d-spacing  measurement  should  vanish  at 
6  =  90°.  Details  on  determination  of  lattice  constants  using  powder  measurements,  and 
references  are  given  by  Klug  and  Alexander  (1974). 

The  standard  Guinier  camera  employing  transmission  geometry  normally  limits  the 
angular  range  to  =  45°.  Extrapolation  methods  are  of  little  use  in  this  case. 
Precision  determinations  require  very  accurate  angle  measurements.  Since  a  good 
Guinier  camera  gives  very  sharp  lines  when  a  suitable  sample  is  employed,  it  should  be 
possible  to  obtain  accurate  d-spacing  when  using  Guinier  techniques.  Most  powder 
lines  which  are  suitable  for  identification  purposes  are  located  in  the  “Guinier  range”, 
and  it  is  of  practical  importance,  especially  for  analytical  purposes,  to  examine  the 
accuracy  obtainable  with  the  Guinier  camera,  especially  as  it  is  of  much  lower  cost  than 
a  powder  diffractometer  and  as  it  has  a  better  resolving  power  than  a  standard 
diffractometer. 

Using  a  Guinier  camera  the  Bragg  angle  6  is  determined  by  measuring  the  distance  / 
between  the  line  from  the  primary  beam  and  the  powder  line  from  a  given  d-spacing. 
For  a  camera  of  diameter  114-592  mm  the  nominal  0-value  is  found  as  ^(degrees) 
=  /(mm)/4  and  A0(degrees)  should  accordingly  be  given  as  A/(mm)/4.  Film  shrinkage 
and  other  sources  of  error,  such  as  tack  of  reproducibility  of  sample  to  film  distance, 
require  calibration  of  the  film,  e.g.  by  using  a  reference  sample  with  accurately  known 
lattice  constants  for  accurate  determinations  of  d-spacings. 

We  have  been  using  a  comparator  yielding  nominal  uncertainty  of  0-2  gm  in 
measuring  /-values.  This  uncertainty  is  far  below  other  uncertainties  connected  with  d- 
spacing  measurements,  and  we  assume  therefore  that  the  reproducibility  of  our  d- 
spacing  measurements  reflect  the  sum  of  other  errors  connected  with  the  methods 
applied  such  as  imperfection  in  camera  construction  and  deviations  from  linearity  in 
film  shrinkage  and  the  like. 
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2.  Experimental 

We  use  an  Enraf-Nonius  FR  552  Guinier  camera  with  a  Philips  fine-focus  (04  x  8  mm) 
x-ray  tube.  The  camera  has  three  compartments  allowing  simultaneous  exposure  of 
three  samples.  Pure  CuKaj  radiation  is  used  in  all  three  compartments. 

The  comparator  was  made  by  Zeiss,  Jena,  and  was  modified  according  to  Tomkins 
and  Fred  (1951)  and  Bennet  and  Koehler  (1959).  Line  positions  can  be  read  with  a 
nominal  uncertainty  of  0-2  /im  and  peak  intensities  can  be  determined  over  a  range 
from  1  to  100.  Our  samples  have  been  exposed  at  22-24°C.  No  strict  temperature 
control  has  been  kept.  Thermal  expansion  coefficients  of  many  simple  inorganic 
compounds  are  of  the  order  of  magnitude  of  10”  ^/°C.  An  indeterminacy  of  1-2°  would 
allow  ^.djd  to  be  approximately  10”“^  for  most  “hard”  substances. 

In  the  past,  we  have  used  one  compartment  in  our  triple  Guinier  camera  for  the 
reference  sample  and  corrected  0-values  in  the  two  other  compartments  using  the 
calibration  values  obtained  from  our  standard.  Flowever,  when  the  comparator  became 
available  to  us,  we  found  that  this  procedure  is  not  valid  when  the  precision  of 
measuring  /  is  better  than  OT  mm.  Therefore  we  now  mix  our  reference  material  with 
the  sample  when  high  accuracy  is  required.  Normally,  we  use  semiconductor  grade 
silicon,  and  when  silicon  is  not  well  suited  we  use  germanium.  Unfortunately, 
germanium  powder  is  quickly  contaminated  by  Ge02  and  only  freshly  powdered 
germanium  is  a  suitable  reference  material.  Although  we  can  normally  reproduce 
readings  to  the  nominal  uncertainty  of  0-2  ^m  when  repeating  measurements  of  a  given 
line  within  a  short  time  scale  of  a  few  minutes,  the  reproducibility  of  measuring  lines  on 
a  given  film  at  different  days  is  about  0  005-0  01  mm.  Corrections  from  different 
exposures  can  vary  as  much  as  OT  mm. 

For  comparison  we  have  also  used  a  standard  Bragg-Brentano  type  powder 
diffractometer  equipped  with  a  diffracted  beam  graphite  monochromator  and  a  strip 
chart  recorder.  We  use  Co-radiation  with  the  diffractometer  and  both  the  aj  and  the  a  2 
components  are  registered.  The  diffractometer  allows  angles  to  be  measured  up  to  1 58°. 

3.  Discussion 

The  corrections  A0  can  be  reproduced  within  an  uncertainty  of  about  0  01  mm  by  a 
polynomium  of  second  degree.  We  have  tried  two  expressions; 

a  -  d  +  b 

A6  =  a  +  bd  +  cd^  and  A0  =  - - ~ 

1  +c6 

and  determined  a,  b  and  c  by  ordinary  least-squares  methods  using  a  microcomputer. 

In  principle,  A0  should  vanish  at  0  =  0.  Experience  shows,  however,  that  it  is 
advantageous  to  include  a  constant  term  in  the  expressions.  In  most  cases  there  is  little 
difference  between  the  applicability  of  the  two  expressions.  Some  care  must,  however, 
be  exercised  when  using  the  hyperbolic  expression  as  “c”  is  normally  negative  and  a 
singularity  sometimes  occurs. 

The  validity  of  our  approach  has  been  checked  by  measurements  on  high  purity 
samples  of  germanium  and  of  tungsten  using  semi-conductor  grade  silicon  as  reference 
sample.  Lattice  constants  were  calculated  both  by  using  {tgO)^  as  weights  and  by  using 
unit  weights.  No  significant  difference  was  found,  although  the  weights  differ  with  a 
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factor  of  10  when  the  {tg&f  weighting  scheme  is  used. 

For  comparison  the  lattice  constants  were  determined  also  from  powder  dif¬ 
fractometer  measurements  using  an  instrument  allowing  measurements  up  to  1 58°  in  2- 
theta.  Peak  positions  were  read  from  strip  chart  recordings.  Also  in  this  case  we  used 
silicon  as  an  internal  standard. 

The  lattice  constants  were  also  in  this  case  calculated  using  both  [tgOf  weighted  and 
unweighted  data.  The  range  of  weights  in  this  case  was  from  1-120  (W)  and  1-28  (Ge). 
The  {tgdf  weighting  scheme  has  some  similarity  with  an  extrapolation  procedure.  The 
lattice  constants  for  Ge  and  W  determined  by  both  methods  are  given  in  table  1.  There 
are  no  significant  differences  between  the  lattice  constants  obtained  by  the  two 
methods. 

As  a  further  check,  we  tried  to  distinguish  between  two  samples  of  mullite  of  slightly 
different  compositions.  Mullite  is  a  crystalline  phase  composed  of  AI2O3  and  Si02.  It 
exists  in  the  interval  60-67  mole%  AI2O3.  The  literature  gives  conflicting  information 
on  the  melting  of  mullite.  Some  authors  claim  that  the  melting  is  congruent,  others  that 
it  melts  incongruently.  References  on  such  observations  and  a  phase  diagram  of  AI2O3, 
Si02  are  given  by  Aramaki  and  Roy  (1959). 

Neuhaus  and  Richartz  (1958)  have  grown  single  crystals  of  mullite  of  composition 
2AI2O3,  Si02  by  the  Czochralski  method.  This  supports  the  assumption  of  congruent 
melting.  We  have  also  at  Aarhus  succeeded  in  growing  transparent  mullite  crystals  of 
2 : 1  composition  by  Czochralski  growth  using  an  iridium  crucible  to  contain  the  melt. 

The  quality  of  these  crystals  was  assessed  by  neutron  and  x-ray  diffraction.  We  were 
not,  however,  able  to  grow  single  crystals  from  melts  of  composition  3AI2O3,  2Si02. 
These  experiments  gave  invariably  a  polycrystalline  mass.  We  examined  the  differences 
in  lattice  constants  between  2 : 1  and  3 : 2  mullites  by  powder  diffraction  using  both  the 
Guinier  camera  and  the  powder  diffractometer.  In  both  types  of  measurements  silicon 
was  added  as  internal  standard. 

Precision  measurements  of  Guinier  diagrams  of  the  two  samples  showed  small  but 
significant  differences.  All  <i-spacings  of  the  2 : 1  sample  were  larger  than  the 
corresponding  ones  from  the  3 ;  2  sample.  The  differences  varied  from  0  01 76  A  for  the 
largest  d-spacing  measured  (541  A)  to  00011  A  for  the  smallest  spacing  (1-24  A). 
Expressed  in  theta-angles  the  difference  varied  from  0  04  to  OT  degrees. 

The  powder  diagrams  were  indexed  preliminarily  using  lattice  constants  obtained 


Table  1.  Lattice  constants  of  Ge  and  W  by  Guinier  and  diffractometer  methods  using  Si 
as  reference  material.  CuKai  =  1-540598.4,  CoKoti  =  1-78890 A.  Lattice  constant,  a,  of 
Si  =  5-43083  A. 


Method 

No.  of 
lines 

Radiation 

Weighting 

scheme 

Sample 

a  (A) 

cr„  A 

Guinier 

5 

CuKa; 

{tgor 

Ge 

5-65844 

6-39  X  10““^ 

Guinier 

5 

CuKiXi 

unit  weights 

Ge 

5-65859 

1-03  X  10“^ 

Diffract. 

7 

CuKai 

{tgey 

Ge 

5-65700 

7-31  X  10'* 

Diffract. 

7 

CuKai 

unit  weights 

Ge 

5-65661 

1-31  X  10“^ 

Guinier 

3 

CuKaj 

(tgoy 

W 

3-1651 

3-57  X  10'^ 

Guinier 

3 

CuKa, 

unit  weights 

W 

3-1652 

4-62  X  10'^ 

Diffract. 

6 

CoKai 

(tgof 

W 

3-16459 

1-58  X  10‘^ 

Diffract. 

6 

CoKa, 

unit  weights 

W 

3-16454 

2-23  X  10"^ 
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Table  2.  Lattice  constants  of  two  mullite  samples.  One  of  composition  2AI2O3,  ISiOj  (2 ;  1 
mullite)  and  another  of  composition  3AI2O3,  2Si02  (3:2  mullite).  Silicon  was  used  as 
reference  sample.  Values  of  constants  employed  are  given  in  table  1. 


Method 

No.  of 
lines 

Radiation 

Weighting 

scheme 

Sample 

«(A) 
(T„(X  10 

b(A) 

^r^fxlO-^) 

c(A) 

(7,(x  lO--") 

Guinier 

35 

CuKa, 

(tgey 

2 : 1  mullite 

7-5927 

1-86 

7-6840 

1.78 

2-8882 

3-56 

Guinier 

35 

CuKai 

unit  weights 

2 : 1  mullite 

7-5924 

1-88 

7-6841 

1-86 

2-8882 

3-86 

Guinier 

37 

CuKai 

itger 

3 : 2  mullite 

7-5748 

1-75 

7-6836 

1-94 

2- 8857 

3- 43 

Guinier 

37 

CuKai 

unit  weights 

3 ;  2  mullite 

7-5750 

1-76 

7-6838 

1-92 

2- 8857 

3- 62 

Diffract. 

19 

CoKa, 

unit  weights 

2 : 1  mullite 

7-5852 

1-8 

7-6831 

2-5 

2-8878 

7 

Diffract. 

16 

CoKa, 

unit  weights 

3 ;  2  mullite 

7-5745 

2-15 

7-6839 

1-5 

2-8855 

7 

from  measurements  on  a  Picker  four-circle  diffractometer  on  a  single  crystal.  Thirty 
seven  reflections  could  be  indexed  unambiguously  and  these  were  used  in  a  least- 
squares  analysis  for  determinations  of  lattice  constants.  Standard  deviations  of  the 
Guinier  data  are  smaller  than  those  from  the  diffractometer  data.  The  data  are  given  in 
table  2. 


4.  Conclusion 

It  is  possible  to  determine  d-spacings  of  high  accuracy,  Ad/d  x  10“'^  in  the  analytically 
most  important  angular  range  using  a  commercially  available  Guinier  camera, 
provided  that  the  powder  lines  are  measured  with  a  precision  corresponding  to  an 
uncertainty  of  9  of  about  0002-0005°.  Although  our  comparator  is  hand-operated,  the 
measuring  process  is  not  unduly  time-consuming.  Automated  equipment  does  exist  and 
Edmonds  and  Henslee  (1978)  report  on  the  use  of  microcomputer-controlled  film 
densitometry  applied  to  Guinier  films.  Their  results  are  comparable  with  ours. 
Computing  facilities  need  not  be  very  sophisticated  for  handling  computations  like 
evaluating  measurements,  indexing  lines  and  making  least-squares  determinations  of 
lattice  constants.  We  employ  a  microcomputer  with  48  K  bytes  memory  and  two  floppy 
disk  drives,  but  also  a  smaller  computer  could  do  most  of  the  computations. 
Search/match  procedures  are  often  helped  by  the  high  precision  of  the  data  and  the  cost 
of  a  Guinier  camera  -|-  comparator  -f  microcomputer  is  lower  than  that  of  a  powder 
diffractometer  of  similar  capacity. 
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The  phase  problem  in  neutron  diffraction 


S  K  SIKKA 

Neutron  Physics  Division,  Bhabha  Atomic  Research  Centre,  Trombay,  Bombay  400085,  India 

Abstract.  The  structure  of  any  crystal  can  now  be  determined  ab  initio  from  the  neutron  Bragg 
intensity  data  alone,  without  recourse  to  the  x-ray  structural  analysis.  This  has  been  made 
possible  by  the  reduction  in  data  collection  times  due  to  the  availability  of  increased  neutron 
fluxes  at  the  samples  and  the  extensive  development  of  the  phase  determining  procedures  for 
neutron  diffraction  in  the  last  15  years.  In  this  review,  we  describe  the  applications  of  direct 
methods,  anomalous  dispersion  techniques  and  difference  Patterson  methods  and  discuss  why 
these  methods  are  applicable  in  neutron  diffraction.  Their  limitations  are  also  discussed.  Some 
newer  methods  like  resonance-modulated  diffraction  and  use  of ‘Renninger  effect’  to  measure 
the  structure  invariants  are  also  touched  upon. 

Keywords.  Neutron  diffraction;  Patterson  method;  anomalous  dispersion  techniques;  direct 
methods. 


1.  Introduction 


Neutron  diffraction  has  distinct  advantages  in  structural  analysis  of  crystals:  (i)  to 
locate  hydrogen  atoms,  thereby  providing  in  hydrogen-bonded  crystals  a  very  precise 
stereochemical  information  about  hydrogen  bonds  and  (ii)  to  differentiate  the 
neighbouring  elements  in  the  periodic  table,  say,  nitrogen  from  carbon  and  oxygen,  in 
crystals  of  organic  and  biological  molecules.  Lately  the  determination  of  deformation 
charge  densities  in  organic  molecules  by  combined  x-ray  and  neutron  studies  has  also 
become  possible.  For  a  recent  review  of  the  above  applications  of  neutron  diffraction 
see  Chidambaram  (1981). 

Till  late  sixties,  the  neutron  diffraction  of  a  given  crystal  was  studied  only  after 
analysis  of  its  structure  by  x-rays.  Once  the  x-ray  non-hydrogen  skeleton  of  the  structure 
was  available,  the  phase  problem  really  did  not  exist  in  neutron  diffraction.  This  is 
because,  from  neutron  scattering  length  considerations  (see  table  1)  a  typical  organic 
crystal  containing  C,  H,  N,  Cl  and  O  atoms  etc  may  be  assigned  a  formula,  Cp  •  Then 
the  average  contribution  of  non-hydrogen  atoms  to  neutron  intensities 


P(0-66)^ 

P(0-66)2-bQ  (0-372)2 


(1) 


for  P  =  Q  case  is  0-7.  For  this  o^,  92%  of  phases  evaluated  from  non-hydrogen  atom 
positions  only,  according  to  the  theory  of  Ramachandran  and  Parthasarathy  (1965), 
should  be  within  90°  of  the  true  phases,  a  criterion  which  usually  leads  to  the  final 
structure.  In  an  actual  case,  carbon  monoxide  derivative  of  myoglobin  (Norvell  et  al 
1975),  the  true  and  the  calculated  phases  with  omission  of  hydrogen  atoms  differed  on 
the  average  by  31°  up  to  1-8  A  Bragg  resolution. 

Two  reasons  can  be  attributed  for  not  carrying  out  neutron  structure  determinations 
ab  initio.  First,  because  of  low  incident  neutron  fluxes  from  nuclear  reactors 
(~  lO'^  neutrons/cm^/sec  at  the  sample)  in  earlier  times,  the  neutron  Bragg  intensity 
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Table  1.  Neutron  scattering  amplitudes  for  some  elements. 


z 

Element 

6(10- cm) 

1 

H 

-0-372 

b% 

1-04 

6- 

-407 

1 

D 

0-67 

6 

C 

0-6626 

7 

N 

0-940 

8 

0 

0-58 

17 

Cl 

0-98 

48 

‘”Cd 

0-725 -f4-507( 

(at  A  =  0-678  A) 

62 

'“’Sm 

0-795  +  6-051/ 

(at  A  =  0-915  A) 

66 

Dy 

1-69 

•S^Dy 

4-94 

92 

U 

0-85 

*b+  for  /  +  j  and  for  /— 5  compound 
nucleus  states. 


data  collection  was  very  time-consuming  compared  to  x-rays.  Second,  because  of  the 
differences  in  which  x-rays  and  neutrons  interact  with  the  atoms  making  up  the  crystals, 
it  was  suspected  that  the  customary  phasing  methods  of  x-rays  like  the  heavy  atom 
technique,  direct  methods  etc  could  not  be  employed  in  neutron  diffraction.  However, 
with  the  advent  of  high  flux  reactors  (~  10^-10^  neutrons/cm^/sec  at  the  sample)  and 
building  of  automatic  four-circle  diffractometers,  the  neutron  Bragg  intensity  data 
collection  times  have  been  drastically  reduced  and  full  three-dimensional  data  are  now 
routinely  recorded  (see  Sequeira  et  al  1978).  These  have  motivated  many  attempts  to 
solve  the  crystal  structures  directly  from  neutron  data.  These  have  been  quite  successful 
and  have  shown  that  the  direct  methods  and  anomalous  dispersion  techniques  of  x-rays 
can  be  applied  in  neutron  diffraction  as  well.  Since  this  volume  is  dedicated  to  Prof. 
Ramaseshan,  it  is  appropriate  here  to  review  these  successes,  as  he  was  the  first  to 
recognise  the  higher  potential  of  neutron  resonance  scattering  compared  to  the  x-ray 
anomalous  dispersion  scattering  for  solving  the  phase  problem.  Before  we  do  so,  it  is 
instructive  first  to  examine  the  causes  of  the  differences  in  x-ray  and  neutron  phase 
problems  in  some  detail. 

2.  Differences  between  x-ray  and  neutron  phase  problems 

The  differences  arise  from  the  fact  that  the  scattering  agents  in  atoms  are  different  for 
the  two  radiations.  Whereas  the  x-rays  are  essentially  scattered  by  atomic  electrons,  the 
scattering  centres  for  neutrons  are  predominantly  the  nuclei.  The  nuclear  scattering 
length  for  neutrons  can  be  written  as 

b  =  bo  +  b'  +  ib".  (2) 

The  term  b^  represents  the  hard  sphere  or  potential  scattering  contribution  and  is 
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independent  of  neutron  energy,  b'  and  b"  are  the  energy  dependent  resonance  scattering 
contributions.  In  most  cases,  b"  is  negligible.  It  is  this  energy-dependent  b'  term  which 
makes  the  neutron  scattering  amplitude  an  irregular  but  featureful  function  of  the 
atomic  number. 

Expression  (2)  is  similar  to 

f  =  fo  +  Af'  +  iAr  (3) 

the  x-ray  scattering  factor  of  the  atom.  A/'  and  A/"  are  significant  only  when  the 
wavelength  of  the  radiation  is  close  to  the  absorption  edge  of  this  atom. 

Neutron  scattering  amplitudes  for  some  elements  are  given  in  table  1,  from  which 
many  things  are  apparent. 

2.1  Equal  atoms 

The  neutron  scattering  factors  for  different  nuclei  are  of  the  same  order  of  magnitude 
and  do  not  display  as  much  variation  as  in  the  x-ray  case,  for  which  it  changes  by  a 
factor  of  90  from  the  hydrogen  to  the  uranium  atom.  This,  although  conferring  the 
major  advantage  to  neutron  diffraction  for  location  of  protons,  leads  to  the  fact  that 
none  of  the  nuclei  present  in  the  crystal  will  dominate  the  scattering  of  neutrons  and 
hence  there  will  be  no  heavy  atoms  for  neutron  diffraction.  The  variation  in  h’s  for 
different  elements  is  such  that  at  the  most,  the  neutron  ‘heavy  atoms’  are  of  strength 
equivalent  to  that  of  a  Cl  atom  in  a  crystal  containing  C,  H,  O  atoms  etc.  for  the  x-ray 
case*t.  Further,  as  the  scattering  amplitude  of  the  hydrogen  nucleus  is  comparable  to 
other  elements,  there  are  more  numbers  of  atoms  to  be  located  at  the  phasing  stage  in  a 
neutron  structure  analysis.  Consequently,  the  neutron  Patterson  map  will  be  more 
overcrowded  and  difficult  to  interpret  directly. 

2.2  Negative  scattering  factors 

The  x-ray  scattering  factors  are  all  positive  while  for  neutrons,  they  are  of  both  signs. 
This  fact  may  be  expected  to  have  the  following  consequences: 

(a)  The  neutron  Patterson  map  will  now  contain  both  positive  and  negative  vector 
peaks  and  hence  some  of  them  will  be  removed  from  the  map  by  their  overlap,  thereby 
making  difficult  the  deconvolution  of  the  Patterson  function  with  the  help  of  super¬ 
position  methods  like  the  minimum  and  sum  functions  etc.  However  for  small 
structures,  this  has  not  proved  a  serious  drawback.  The  structures  of  potassium 
hydrogen  chloromaleate  (Ellison  and  Levy  1965)  and  BeS04  ■  4H2O  (Sikka  and 
Chidambaram  1969)  have  been  solved  by  the  combination  of  symmetry  minimum  and 
minimum  functions. 

(b)  The  negativity  of  b  also  appears  to  rule  out  the  possibility  of  using  direct  methods 
in  neutron  diffraction  as  hydrogen  nucleus  {b  is  negative)  are  present  in  almost  every 
interesting  crystal.  This  led  Karle  (1966)  to  propose  the  squared  structure  approach  to 


*  ‘*^Cd,  etc.  can  act  as  heavy  atoms  for  structures,  say  of  200  atoms  per  one  heavy  atom  in  the 

unit  cell.  However,  they  are  of  wrong  type.  They  cannot  be  easily  introduced  into  such  structures  and  also 
their  high  absorption  cross-sections  for  neutrons  pose  problems  in  data  collection, 
tin  some  of  the  structures  e.g.  U03(N03)2  •  6H2O  (Taylor  and  Muller  (1965)  and  HAUCI4  2H2O 
(Williams  and  Peterson  1969)  which  have  been  solved  by  the  ‘heavy  atom’  neutron  Patterson  method,  this 
situation  prevailed. 
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circumvent  the  difficulty  caused  by  the  negative  scattering  factors.  He  showed  that  from 
the  observed  neutron  normalized  structure  factors  (£;,),  it  is  possible  to  derive  another 
set  of  normalized  structure  factors  ( V^)  corresponding  to  a  structure  in  which  all  the 
atoms  scatter  with  the  square  of  the  original  scattering  factors  but  whose  positions  are 
the  same  as  in  the  original  structure.  The  scattering  density  would  then  become  positive 
everywhere  and  the  phase  problem  for  this  ‘squared’  crystal  can  be  solved  using  the 
direct  methods.  The  success  of  this  approach  was  demonstrated  by  Ellison  and  Levy 
(1967),  who  used  it  to  solve  the  structure  of  glycolic  acid. 

Sikka  (1969, 1970),  however,  re-examined  the  consequences  of  the  negativity  of  b  for 
hydrogen  atoms  and  found  that  these  are  not  as  serious,  as  assumed  for  the  straight 
employment  of  direct  methods  in  neutron  diffraction.  The  amount  of  average  neutron 
scattering  from  hydrogen  atoms  present  in  a  crystal  defined  by 

--tbjlibj  (4) 

j=l  I  7=1 

is  rarely  more  than  30%  in  a  crystal.  Up  to  this  a,  the  criterion  for  the  application  of 
direct  methods  in  x-rays  i.e.  the  normalized  structure  factors  for  actual  and  the  squared 
structures  (E^  and  V^)  should  have  the  same  sign  or  nearly  equal  phases,  is  very  well 
satisfied  in  neutron  diffraction  for  ^  1  as  shown  in  figure  1. 

2.3  Isotopic  differences 

The  neutron  scattering  amplitudes  vary  from  isotope  to  isotope.  This  makes  the 
difference  Patterson  technique*  more  attractive  for  neutrons.  This  map  now  will 


Figure  1.  The  probability  P(E^  that  £*  and  will  be  alike  in  sign  for  different  and  a 
or  x(  =  P/Q)  (from  Sikka  1969). 


*  Difference  Patterson  can  in  principle  be  utilized  with  polarised  neutrons  for  crystals  containing  ordered 
magnetic  atoms,  for  whom  the  scattering  factors  are  neutron  polarisation-dependent.  As  is  well-known,  for 
spin-up  polarisation  of  neutrons,  the  scattering  by  the  magnetic  atom  is  proportional  to  (b  -f  p)  and  for  the 
spin-down  case,  it  is  proportional  to  (b—p).  Here  p  =  magnetic  atom  form  factor.  For  Ni,  p  =  0T3 
X  10”*^  cm  at  sin  0/A  0  compared  to  b  =  102  x  10“^^  cm. 
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contain  only  RR  and  RO  vectors  {R  =  replaceable  atoms,  0  =  other  atoms)  with  peak 
heights  {bf  —b^^)and  [(h^i  —  ^^2)  ^  ^0]  since  and  can  be  of  opposite  sign, 
RO  vectors  can  get  strengthened.  However,  one  has  to  know  the  positions  of  the  R 
atoms  to  recover  the  structure  from  the  difference  Patterson  map.  Again  as  these  will 
not  be  heavy  scatterers  of  neutrons,  unlike  in  x-rays,  these  may  be  difficult  to  locate. 

A  neat  application  of  the  isotopic  replacement  method  is  due  to  Johnson  (1967)  in  3 
endo-phenyl-2-endo  norbornanol  (60  atoms  in  the  asymmetric  unit).  For  this  com¬ 
pound  the  neutron  intensity  data  were  collected  from  two  crystals — one  containing 
hydrogen  atoms  and  the  other  in  which  4  of  the  hydrogen  atoms  had  been  selectively 
substituted  by  deuterium  atoms.  The  replaceable  atoms  were  first  found  by  a  ( |  1 

—  jF^)!)^  Patterson  (similar  to  (A|F|)^  synthesis  of  Rossmann  (1961)  for  anomalous 
dispersion).  A  difference  Patterson  map  (F^  —  F^)  helped  to  locate  additional  eight 
atoms  and  the  remaining  atoms  were  found  by  the  usual  Fourier  synthesis. 

2.4  Complex  scattering  amplitudes 

The  neutron  scattering  amplitudes  for  some  nuclei  like  “^Cd,  ^^^Gd,  ^^^Xe 

etc  which  have  high  resonant  absorption  for  thermal  neutrons  are  complex  and  hence 
the  anomalous  dispersion  method  of  x-rays  becomes  applicable  in  neutron  diffraction 
(Peterson  and  Smith  1962).  However,  there  are  quantitative  differences  in  the  values  of 
real  and  imaginary  dispersion  terms  for  the  two  radiations  (Ramaseshan  1966;  Dale 
and  Willis  1966)  as  shown  below  for  ^^^Cd 


x-rays 

Neutrons)'  '^Cd) 


b'/bo 

~0-3 

74  at2  =  0-55  A 
-74  at  A  -  0-8  A 


b'fbo 

~0-3 

12  4  at  A  =  0-68  A 


The  neutron  values  of  these  ratios  are  an  order  of  magnitude  higher*  than  for  the  x- 
rays  and  can  be  varied  significantly  by  a  variation  in  wavelength  (figure  2).  This  larger 


1 49 


AMPLITUDE  Sm 


*  These  ratios  for  ‘^^Gd  and  *^^Xe  are  even  higher 
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anomalous  dispersion  effect  for  neutrons  can,  therefore,  be  used  to  tackle  more 
complex  structures  than  are  possible  by  x-ray  anomalous  dispersion  method.  The 
power  of  this  technique  is  shown  in  figure  3,  taken  from  Sikka  and  Rajagopal  (1975),  by 
high  average  values  of  the  Bijvoet  ratios: 

<^>=  <A///> 

A/ =  |lF(/i/c/)|2-|F(/jk/)l^|  (5) 

/  =  0-5[|F(;j/c/)|"  +  |F(/iH)|'‘] 

for  hypothetical  structures  containing  N  atoms  and  one  resonant  atom  (natural  Sm  in 
this  case,  containing  13%  resonant  isotope).  Note  that  in  the  x-ray  case, 

structures  have  been  solved  when  <^  >  has  been  as  low  as  OT. 

3.  Applications  and  status  of  direct  and  anomalous  scattering  methods  in  neutron 
diffraction 

3.1  Direct  methods 

The  application  of  direct  methods  (Ij,  tangent  formulas,  and  cosine  and  sine 
invariants  methods  etc)  for  neutron  diffraction  both  for  centrosymmetric  and  non- 
centrosymmetric  structures  is  now  an  accepted  fact.  So  far,  about  25  structures  have 
been  solved  by  this  technique  and  these  have  not  revealed  any  significant  differences 
between  the  application  of  direct  methods  in  x-ray  and  neutrons.  The  errors  in 
phases  (figure  4)  determined  by  this  technique  are  similar  for  the  two  radiations 
( <  1  <^o  ~  <Atan  I  )av  =  28°  for  the  neutron  study  of  Lj-threonine  (Ramanadham  et  al 
1973)  compared  to  the  value  22°  for  x-ray  study  of  alkaloid  Panamine  (Karle  and  Karle 


Figure  3.  Variation  of  <^>  vs  A  for  crystals  containing  one  natural  Sm  atom  and  N  other 
atoms  (from  Sikka  and  Rajagopal  1975). 
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Figure  4.  Frequency  distribution  of  the  error  <  |  </'o  ~  i^tan  D  non-centrosymmetric 
neutron  reflections  in  Lj-threonine  (from  Ramanadham  et  al  1973). 


1966)).  The  largest  structure  solved  using  neutron  data,  by  routine  application  of  the 
program  multan  (Germain  et  al  1971),  up-to-date,  is  that  of  Melampodin  containing 
54  atoms  in  the  asymmetric  unit  (Bernal  and  Watkins  1973).  Out  of  the  22  largest 
maxima  on  the  E  map  of  this  structure,  only  four  were  spurious.  In  view  of  the  above,  it 
may  now  be  said  that  direct  methods  can  be  applied  in  a  neutron  structure  analysis  for 
almost  any  crystal  to  which  they  are  applicable  in  x-rays. 


3.2  Anomalous  dispersion  method 

In  spite  of  the  higher  power  of  this  technique  in  neutron  structure  analysis  for  large 
crystals  like  proteins  (as  discussed  in  §  2.4),  the  method  has  not  lived  up  to  its  early 
promise.  So  far,  the  method  has  been  tested  on  six  small  structures 


(i)  Cd(N03)2  •  4.D2O 

(ii)  Sm(Br03)3  •  9H2O 

(iii)  NaSm  (edta) -81120 

(iv)  Cd(tartrate)  •  5H2O 

(v)  Cd-Histidine  •  2H2O 

(vi)  agua(L-glutamato)Cd(II)  •  H2O 


(MacDonald  and  Sikka  1969), 
(Sikka  1969a), 

(Koetzle  and  Hamilton  1975), 
(Sikka  and  Rajagopal  1975), 
(Bartunik  and  Fuess  1975), 
(Flook  et  al  1977), 


and  one  protein,  “^Cd  myoglobin  (Schoenborn  1975).  The  method  is  beset  with  the 
following  difficulties,  especially  for  complex  crystals. 


3.2a.  Phase  ambiguity  resolution:  It  is  well  known  that  phases  determined  by  the 
anomalous  dispersion  method  at  a  single  wavelength  for  a  crystal  have  two  fold 
ambiguity: 


0  = 

rn-fl  \Fihkl)\^-\FChkW 

4|n||F;| 


(6) 


(7) 
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In  the  x-ray  case,  this  ambiguity  is  resolved  or  circumvented  by  (i)  the  heavy  atom 
method  (Ramachandran  and  Raman  1956),  (ii)  the  double-phase  Fourier  synthesis 
method  (Ramachandran  1964),  (iii)  the  sine-Patterson  method  (Okaya  et  al  1955)  and 
(iv)  the  two  wavelength  method  (Raman  1959;  Ramaseshan  et  al  1957).  For  neutron 
diffraction  as  the  anomalous  site  is  a  light  weight  from  scattering  point  of  view,  method 
(i)  cannot  be  used.  Attempts  by  Schoenborn  to  use  this  method  in  ‘  ^  ^Cd  myoglobin  did 
not  yield  good  results.  Methods  (ii)  and  (iii)  have  successfully  been  applied  in 
Cd(N03)2  4020  (see  figure  5),  Cd(tartrate)  •  5H2O  and  agua(L-glutamato) 
Cd(II)  •  H2O  (see  figure  6).  But  for  large  molecules,  these  methods  may  not  be 
applicable.  This  is  because,  if  hydrogen  atoms  are  present  in  the  structure  (this  will  be 
almost  so  in  every  large  crystal),  then  because  of  their  negative  scattering  length,  they 
will  give  peaks  of  opposite  polarity  in  the  positive  and  negative  images  of  the  structure 
which  are  present  in  Fourier  or  Patterson  maps  of  the  crystal  due  to  these  two  methods 
and  thus  cause  confusion  in  picking  out  the  correct  structure.  The  limitations  of  the  two 
wavelength  method  (successful  in  Sm(Br03)3-9H20  and  Na  Sm(EDTA)'  8H2O)  have 
been  examined  by  Sikka  (1969b).  If  the  phase  ambiguity  has  to  be  resolved  then  the  two 
wavelengths  chosen  for  data  collection  have  to  be  on  the  opposite  side  of  the  resonance. 
For  Cd,  this  is  not  practicable  (Aj  <  0  68  <  X2  and  at  the  incident  neutron  flux  will 
be  low,  scattered  intensity  will  also  be  less  (a/^ )  and  there  will  be  intense  overlapping  of 
reflections)  unless  a  hot  source  coupled  with  the  reactor  is  used.  The  best  anomalous 
scatterer  for  the  two  wavelength  method  seems  to  be  ^^’Gd  (see  figure  7).  It  has 
resonance  at  1-6  A.  However,  no  structure  containing  this  isotope  has  been  studied  so 
far. 

Another  method  suggested  to  circumvent  the  phase  ambiguity  is  to  use  the  tangent 
formula  of  direct  methods  (Sikka  1973).  The  basis  of  this  is  that  in  a  tangent  formula 
calculation,  if  the  input  phases  are  within  45°  of  their  true  values,  the  tangent  formula 
will  determine  and  refine  the  phases  to  a  correct  set.  Now  this  input  set  of  phases  can  be 
obtained  in  two  ways  from  the  anomalous  dispersion  data. 

(i)  One  can  use  the  two-wavelength  method  for  resolving  the  phase  ambiguity  for  a 

limited  set  of  reflections  as  suggested  by  Bartunik  and  Fuess  (1975).  For  Cd 
Histidine  •  2H2O,  these  authors  could  measure  the  phases  utilizing  anomalous 
dispersion,  with  <|0o  ~0anm  I  )av  2'3°. 

(ii)  The  centroid  of  the  two  phases,  (/>!,  and  (/)2  given  by  formula  (6)  can  also  be  used. 
As  pointed  out  by  Sikka  (1973)  and  others,  the  errors  in  phases  by  choosing  the  centroid 
will  be  limited  to  90°  and  there  will  be  reasonable  number  of  reflections  with  errors  less 
than  45°.  Weighted  tangent-formula  calculations  for  L-aspartatocadmium(II)  tri¬ 
hydrate  (P2i2i2i)  as  given  in  table  2,  demonstrate  the  success  of  this  approach. 

3.2b.  Location  of  the  anomalous  scatterer.  In  all  the  above  methods  for  ambiguity 
resolution  of  phases,  prior  knowledge  of  the  position  of  the  anomalous  scatterer  is 
implied.  Again  the  task  of  locating  these  atoms  in  neutrons  is  more  difficult  than  for  x- 
rays  (see  §  2.1).  The  position  of  the  anomalous  scatterer  can,  in  principle,  be  found  by 
the  two  wavelength  methods  (for  which  b^{Xi)  =  h^{X2))*  of  Singh  and  Ramaseshan 
(1968)  and  Sikka  (1969c).  Singh  and  Ramaseshan’s  method  is  an  analytical  method 
which  combines  data  at  these  two-wavelengths  to  give  ,  the  contribution  due  to  the 


*  +  the  total  real  part  of  scattering  from  the  anomalous  atom. 
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Figure  5.  Cd(N03)2  ■  4D2O  (a)  Double  phased  Fourier  synthesis  (b)  Anomalous  disper¬ 
sion  Fourier  synthesis  (from  Macdonald  and  Sikka  1969). 


Figure  6.  Composite  map  of  the  sum  function  of  sine  Patterson  map  of  agua 
(L-glutamato)  Cd(II)  ■  H2O  (from  Flook  et  al  1977). 


anomalous  atoms  alone.  The  position  of  the  anomalous  atoms  can  then  be  determined 
by  a  Patterson  synthesis  with  F  j  as  coefficients.  This  method  has  been  tested  by  Koetzle 
and  Hamilton  (1975)  on  NaSm(EDTA)-  8H2O.  Sikka’s  method  is  based  on  the  cosine 
synthesis  of  Okaya  and  Pepinsky  (1961).  However,  when  the  anomalous  scatterer  is 
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Figure  7.  Optimum  wavelength  regions  for  ‘^^Gd  ‘“’Sm  and  “^Cd  for  two  different 
absorption  factors  (from  Jauch  and  Dachs  1981). 


Table  2.  Tangent  formula  refinement  results  on  L-aspartatocadmium(II)  trihydrate  for 
circumventing  the  phase  ambiguity  in  neutron  anomalous  dispersion  method. 


Set 

number 

Input 

set 

Number  of 
reflections 
in  the  input 

(5 

limit 

Weight  assigned 
in  first  cycle 

<  1  <Po  ‘(’tan  I ) 
for  acentric 
reflections 

Number  of  centric 
reflections  with 
wrong  phase 

1 

4>o 

144 

— 

1 

22.4° 

2 

2 

4>. 

88 

90° 

|cos  ^1 

22.5 

3 

3 

<t>c 

88 

90 

|cOS  ^1^ 

22.5 

1 

4 

4>c 

17 

20 

|cOS  ^1 

24.7 

4 

5  is  the  half  angle  of  the  phase  ambiguity  in  (6)  and  (7).  (j)^  is  the  centroid  phase,  <t)o  is  the  true  value  of  phase 
and  is  the  one  calculated  by  the  tangent  formula.  Out  of  144  reflections,  90  were  acentric. 

<  1 00  —  07A1M !  >  is  the  average  error  after  7  cycles  of  refinement. 


“^Cd,  both  these  methods  require  the  use  of  one  neutron  wavelength  of  about  0-7  A, 
which  as  pointed  above  is  not  very  practicable.  The  method  which  has  shown  more 
promise  is  the  (A|F|)^  synthesis  of  Rossmann  (1961),  used  by  Sikka  and  Rajagopal 
(1975)  in  Cd(tartrate)  •  5H2O  and  by  Schoenborn  (1975)  in  Cd  myoglobin.  For  the 
latter  structure,  the  Marker  section  at  A  =  1-25  A  is  shown  in  figure  8.  The  Patterson 
density  is,  clearly,  much  higher  at  the  Cd-Cd  vector  site. 

Recently,  a  novel  way  to  integrate  the  techniques  of  direct  methods  and  anomalous 
dispersion  technique  for  x-rays  has  been  attempted  by  Hauptman  (1982)  which  avoids 
the  prior  determination  of  the  anomalous  dispersion  sites.  It  will  be  interesting  to 
extend  this  approach  to  neutron  diffraction  also. 


Figure  8.  Harker  section  for  Cd  myoglobin  at  X  =  1-25  A  for  (A|F|)^  synthesis  (from 
Schoenborn  1975).  The  densest  peak  is  the  Cd-Cd  vector  peak. 
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4.  Some  new  techniques 

None  of  the  techniques  described  below  has  been  used  in  an  actual  structure  analysis 
but  the  physics  underlying  them  is  attractive. 

4.1  Resonance  modulated  dijfraction 

This  method  has  been  suggested  by  Hayter  et  al  (1974).  It  is  based  on  the  fact  that  the 
neutron  scattering  lengths  of  some  nuclei  are  spin-dependent  (see  table  1  for  the 
hydrogen  nucleus).  If  these  nuclei  in  crystals  can  be  polarised,  then  the  neutrons, 
which  are  parallel  to  the  direction  of  nuclear  polarisation,  see  a  coherent  scattering 
amplitude  of  and  the  neutrons,  which  are  anti-parallel,  will  be  scattered  with 
amplitude  {b'^  +  21b~)/{21  +  1)  (Schermer  and  Blume  1968).  Now  if  the  intensity  data 
are  recorded  from  unpolarized  and  polarized  crystals,  the  phase  problem  can  be  solved 
by  utilizing  the  difference  Patterson  technique. 

Since,  for  the  hydrogen  nuclei,  b*  and  b~  are  large,  this  technique  according  to 
Hayter  (1976)  opens  up  the  possibility  of  using  these  hydrogens  as  heavy  atoms  for 
phase  determination.  Experiments  so  far  (using  microwaves  for  polarizing  hydrogens 
at  1-5°K)  have  only  demonstrated  the  nuclear  polarization  effects  on  Bragg  intensities 
and  no  actual  structure  determination  has  been  carried  out  yet. 

4.2  Three-beam  method  of  Post  (1977)  for  x-rays 

Post  (1977,  1979)  has  demonstrated  that,  in  principle,  the  phase  problem  in  x-ray 
diffraction  can  be  resolved  if  the  Renninger  effect,  in  which  three  Bragg  planes 
simultaneously  reflect  giving  out  three  diffracted  beams,  is  made  use  of  For  in  this  case, 
according  to  the  dynamical  theory,  the  interactions  among  these  beams  are  coherent 
and  their  intensities  will  contain  information  about  the  phases.  Post,  using  a  single 
crystal  of  AI2O3,  has  experimentally  shown  that  the  intensity  distributions  are  indeed 
dependent  upon  the  products  (//  =  (^i^Ji),  ^  =  (^2^2/2)  and  H-K,  the 

difference),  the  structure  invariants*  as  the  theory  predicts. 

The  above  method  can  be  employed  in  neutron  diffraction  also,  as  the  dynamical 
theory  is  common  to  both  x-rays  and  neutrons  (Shull  1973).  The  four-circle 
diffractometers  in  conjunction  with  position  sensitive  neutron  detectors  or  the  film 
methods  in  neutron  diffraction  (Hohlwein  1975)  can  be  employed  to  measure  these 
intensity  distributions.  However,  it  remains  to  be  seen  even  for  x-rays,  whether  the 
method  will  be  useful  for  imperfect  crystals,  such  as  are  usually  employed  for  crystal 
structure  analysis. 
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*  The  phases  of  these  structure  invariants  (products  of  structure  factors  the  sum  of  whose 
indices  are  zero)  are  invariant  to  the  choice  of  the  origin  and  a  knowledge  of  a  few  of  these  is 
enough  to  initiate  the  use  of  direct  methods  for  the  solution  of  phase  problem. 
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Neutron  diffraction  using  an  electron  linear  accelerator 


B  T  M  WILLIS 

Materials  Physics  Division,  AERE,  Harwell,  Oxon  OX  1 1  ORA,  England 

Abstract.  The  Harwell  electron  linear  accelerator  helios  has  been  used  for  studying  the 
crystal  structures  of  polycrystalline  materials  by  neutron  diffraction.  Atomic  positions  and 
atomic  vibrational  amplitudes  can  be  determined  equally  well  with  the  pulsed  method  as  with 
the  more  conventional  method  employing  a  high-flux  nuclear  reactor  as  source.  The  powder 
pattern  can  be  indexed  more  easily  with  pulsed  neutrons. 

Keywords.  Linear  accelerator;  neutron  diffraction. 


1.  Introduction 

HELIOS,  the  136  MeV  electron  linear  accelerator  at  Harwell,  has  been  used  since  1981  to 
study  the  physics  and  chemistry  of  condensed  matter.  The  accelerator,  whose  main 
features  have  been  described  by  Lynn  (1980),  may  be  viewed  as  a  precursor  of  the  more 
intense  Spallation  Neutron  Source  (sns)  which  is  under  construction  nearby  and  due  to 
operate  in  1985. 

Pulsed  neutron  techniques  are  superior  to  those  employing  the  continuous  neutron 
beam  from  a  nuclear  reactor  for  the  study  of  high-energy  excitations  in  a  solid  or  for  the 
measurement  of  the  structure  factors  of  liquids  or  amorphous  materials  where  the 
observations  must  be  made  over  a  wide  range  of  sin  0//1.  It  is  less  obvious  that  there  are 
also  advantages  in  using  the  pulsed  method  to  examine  polycrystalline  materials  by 
time-of-flight  neutron  diffraction.  We  shall  illustrate  these  advantages  by  describing  the 
results  of  some  experiments  with  helios  and  the  Harwell  back  scattering  spectrometer 
(bss).  The  main  features  of  the  bss  have  been  described  by  Windsor  et  al  (1977).  We  shall 
not  consider  neutron  diffraction  from  single  crystals,  although  it  is  in  this  field  that 
pulsed  neutrons  are  particularly  effective  in  solving  the  crystallographic  phase  problem 
using  anomalous  dispersion  (see  Ramaseshan  and  Narayan  1981). 

2.  Time-of-flight  crystallography 

The  derivation  is  given  in  this  section  of  the  basic  formulae  required  for  interpreting  the 
time-of-flight  diffraction  patterns  from  a  polycrystalline  sample  examined  with  a 
pulsed  neutron  source. 

Pulsed  neutrons  emerge  from  the  target  in  the  ‘condensed  matter  cell’  of  helios  by  a 
two-stage  process.  Bremsstrahlung  radiation  with  a  maximum  photon  energy  equal  to 
the  energy  of  the  electron  beam  (136  MeV)  arises  from  interaction  of  the  electrons  with 
the  heavy-metal  target;  the  gamma-rays  then  take  part  in  a  photonuclear  reaction  to 
produce  fast  neutrons.  These  neutrons  are  subsequently  slowed  down  to  the  thermal,  or 
epithermal,  range  by  a  moderator  consisting  of  a  slab  of  ordinary  water  whose 
moderator  time  matches  the  electron  pulse  length.  The  pulse  width  of  the  moderated 
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Figure  1 .  Geometry  of  back-scattering  spectrometer,  bss  .  M  is  the  neutron  moderator,  S  the 
sample  and  D  the  detector.  D  is  coupled  to  a  time-of-flight  analyser  with  2048  time  channels. 


neutrons  varies  with  their  wavelength,  but  the  mean  time  spread  at  the  moderator  of  the 
neutrons  in  a  single  pulse  is  about  5/isec.  This  time  spread,  together  with  the  distance  L 
=  Lq  +  (see  figure  1)  travelled  by  the  neutrons  from  the  moderator  to  the  detector, 
determine  the  resolution  characteristics  of  the  diffraction  pattern. 

Let  t  be  the  time  taken  by  neutrons  of  velocity  v  to  travel  the  distance  L.  Then 

t  =  Ljv  =  (Lmjh)k,  (1) 

where  we  have  used  the  de  Broglie  equation 

A  =  h/(m„v),  (2) 

relating  the  wavelength  A  and  the  velocity,  (h  is  Planck’s  constant  and  m„  the  neutron 
mass.) 

Putting  the  Bragg  equation 

A  —  2d  sin  ©  (3) 

into  equation  (1),  where  20  is  the  scattering  angle  for  the  planes  of  spacing  d,  gives 

t  —  (505-54  Lsin  0)d,  (4) 

with  t  expressed  in  /isec,  L  in  metres  and  d  in  A  units.  The  path  length  L  in  the  bss  is  14  m 
and  the  scattering  angle  is  170°,  so  that  the  time-of*flight  of  the  neutrons  diffracted  by 
planes  of  1  A  spacing  is  a  little  over  7000  /rsec.  Clearly,  with  an  initial  pulse  length  of 
5  /isec  very  little  uncertainty  in  the  flight  time  is  introduced  by  the  finite  pulse  length. 

Equation  (4)  shows  that  the  time  t  is  directly  proportional  to  the  spacing  d.  For  d 
values  in  the  range  0-5  A  <  d  <  2-5  A,  the  corresponding  times  are  between  3,5(X)  and 
17,500  nsec.  The  neutron  counts  are  recorded  in  2048  time  channels  with  a  separation 
of  a  few  /zsec  between  adjacent  channels,  giving  a  diffraction  pattern  such  as  that  shown 
in  figure  2. 

Differentiating  equation  (3)  gives 

A//2  =  cot  ©  A0,  (5) 

and  so  high  resolution  is  achieved  by  working  at  a  high  scattering  angle  2©  in  the  back 
scattering  region,  where  cot  ©  tends  to  nothing  as  ©  approaches  90°.  If  the  pattern  is 
recorded  for  a  ©  value  in  the  forward  scattering  region,  then  the  lines  in  figure  2  would 
be  less  sharply  defined. 

3.  Diffraction  at  low  sin  ©/A:  measurement  of  d-spacings 

The  first  stage  in  solving  a  crystal  structure  is  the  measurement  of  the  unit-cell 
dimensions.  Using  a  polycrystalline  sample  the  crucial  requirement  is  the  accurate 
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Figure  2.  Time-of-flight  diffraction  pattern  of  silicon  at  low  Q.  The  observed  spectrum  has 
been  ratioed  with  a  vanadium  spectrum:  vanadium  is  an  incoherent  neutron  scatterer,  whose 
spectrum  gives  the  wavelength  dependence  of  the  incident  beam  flux. 


measurement  of  the  first  20-30  lines  of  the  powder  pattern.  This  point  has  been 
emphasised  by  Shirley  (1980):  “There  can  be  few  fields  of  endeavour  in  which  careful 
data  preparation  is  as  dramatically  rewarded  as  in  computer-based  powder  indexing.  A 
properly  measured  pattern  is  usually  solved  almost  immediately,  while  mediocre  data 
can  run  forever”.  In  this  context  a  “properly  measured  pattern”  means  that  the  d- 
spacings  must  be  known  to  a  few  parts  in  10,000. 

Figure  2  shows  a  portion  of  the  low  sin  0/A  pattern  from  silicon,  and  table  1  indicates 
the  kind  of  accuracy  that  can  be  achieved  by  measuring  the  corresponding  interplanar 
spacings.  In  this  table  the  time-of-flight  at  the  maximum  of  each  diffraction  peak: 
the  neutron  counts  are  recorded  in  time  channels  8  /rsec  in  width,  and  can  be 
measured  to  a  few  tenths  of  a  microsecond  by  fitting  a  suitable  analytical  function 
(Albinati  and  Willis  1982)  to  each  peak.  The  observed  d-spacing  in  the  third  column  of 
the  table  was  derived  from  the  relation 

^obs  ^^obs’ 

where  c  is  an  instrumental  constant  given  by  (1)  and  (3)  as 
c  =  {2m„/h)  Lsin  ©. 

in  the  last  column  of  table  1  was  calculated  from 
^caic  =  aoih^  +  k^  +  l^)~^ 

with  Uq,  the  cubic  lattice  parameter  of  silicon  at  20°C,  equal  to  5-43070  A.  The  measured 
spacings  are  within  2  parts  in  10'^  of  the  calculated  values. 
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Table  1.  Interplanar  spacings  of  silicon  by  time  of  flight  crystallography. 


hkl 

^obs 

(/isec) 

^obs 

(A) 

^calc 

(A) 

220 

13503-5 

1-9200 

1-9200 

311 

11515-5 

1-6373 

1-6374 

400 

9549-0 

1-3577 

1-3576 

311 

8760-4 

1-2456 

1-2458 

422 

7796-1 

1-1085 

1-1085 

333 

7351-2 

1-0452 

1-0451 

440 

6753-2 

0-9602 

0-9600 

531 

6455-9 

0-9179 

0-9179 

620 

6038-7 

0-8586 

0-8586 

533 

5823-3 

0-8279 

0-8281 

444 

5512-7 

0-7838 

0-7838 

711 

5348-2 

0-7604 

0-7604 

642 

5103-2 

0-7256 

0-7257 

731 

4972-4 

0-7070 

0-7070 

The  exceptionally  good  resolution  Ad/d  arises  in  time-of-flight  crystallography 
because  the  entire  diffraction  pattern  can  be  collected  in  back  scattering.  To  obtain 
comparable  resoluton  at  large  d  spacings  with  a  reactor  source  it  is  necessary  to  design 
the  diffractometer  to  give  optimum  focussing  conditions  at  low  values  of  sin  ©/A,  but 
this  is  not  normally  done  as  it  will  spoil  the  high-sin  0/A  portion  of  the  pattern  (Hewat 
1975). 

4.  Diffraction  at  high  sin  0/A 

4.1  Measurement  of  atomic  positions 

To  assess  the  accuracy  of  the  time-of-flight  technique  in  determining  atomic  positions,  a 
diffraction  pattern  was  recorded  from  monoclinic  zirconia,  Zr02 ,  which  has  a  distorted 
fluorite-type  structure.  There  are  three  atoms  at  general  positions  (not  fixed  by 
symmetry)  in  the  unit  cell. 

Data  were  collected  up  to  sin  0/A  =  1-5  A  “  ^  with  the  back-scattering  spectrometer, 
using  15  g  of  sample  and  30  hr  of  counting  time.  The  data  were  refined  by  the  Rietveld 
method  (see  Albinati  and  Willis  1982)  in  which  the  entire  profile  of  the  diffraction 
pattern  is  matched  to  the  calculated  profile.  The  final  results  are  shown  in  figure  3  and 


Figure  3.  Portion  of  the  time-of-flight  diffraction  pattern  of  monoclinic  zirconia.  Points  are 
the  raw  data  points,  and  the  continuous  line  the  calculated  best-fit  profile. 
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Table  2.  Fractional  atomic  coordinates  of  monoclinic  zirconia  (space  group  P2,/c) 


Linac  method  Nuclear  reactor  method 


Atom 

X 

V 

2 

X 

y 

2 

Zr 

0-2756(4) 

0-0393(4) 

0-2073(4) 

0-276(1) 

0-040(1) 

0-209(1) 

Ol 

0-0681(5) 

0-3308(4) 

0-3453(4) 

0-072(2) 

0-332(1) 

0-344(1) 

On 

0-4494(4) 

0-7570(5) 

0-4802(4) 

0-450(2) 

0-757(1) 

0-478(1) 

(sin  ©//l)n,ax  =  1-5  A  '  (sin  ©//l)n,ax  =  0-8  A  * 


table  2.  Twenty  five  parameters  were  varied  (including  four  lattice  parameters,  three 
isotropic  temperature  factors  and  three  coefficients  defining  the  background),  but  the 
parameters  representing  the  atomic  positions  only  are  given  in  the  table. 

The  same  sample  of  zirconia  has  also  been  analysed  by  conventional  neutron  powder 
diffraction.  The  high-resoluton  neutron  powder  diffractometer  at  aere  Harwell  was 
used  with  the  pluto  nuclear  reactor  as  source,  and  a  Rietveld  refinement  carried  out 
with  a  computer  program  adapted  to  constant-wavelength,  angle-dispersive  measure¬ 
ments.  These  results  are  listed  in  table  2  alongside  those  from  the  time-of-flight 
technique. 

The  numbers  in  parentheses  in  table  2  are  estimated  standard  deviations  (referring  to 
least  significant  digits).  There  is  satisfactory  agreement  between  the  two  sets  of 
coordinates.  The  greater  precision  for  the  coordinates  derived  by  the  Linac  method  is 
associated  with  the  higher  accessible  value  of  sin  ©/A. 

4.2  Measurement  of  atomic  vibrational  amplitudes 

To  examine  the  accuracy  to  be  expected  from  determing  atomic  vibrational  amplitudes, 
a  polycrystalline  sample  of  thoria  was  chosen.  Th02  crystallises  in  the  cubic  fluorite 
arrangement,  so  that  the  atomic  positions  are  fixed  by  symmetry  and  the  vibrational 
parameters  alone  are  to  be  derived  from  the  analysis. 

Neutron  diffraction  data  were  collected  in  two  separate  time  ranges: 

4,000  to  17,500  fisec  (with  8  psGC  time  channels) 

and 

2,5(X)  to  5,000  /isec  (with  2  psec  time  channels). 

The  maximum  value  of  sin  ©//  was  0-9  in  the  first  case  and  1-5  in  the  second. 
Analysis  was  performed  by  the  Rietveld  method  and  the  results  are  given  in  table  3. 

The  table  shows  that  by  extending  sin  0//,  from  0  9  A~  ^  to  1-5  A  “  ^  the  uncertainty 
in  the  measurement  of  the  atomic  vibration  parameters  is  reduced  by  a  factor  of  two. 
Compared  with  a  reactor-based  instrument,  we  gain  by  using  the  ‘hot’  neutrons  from 
the  epithermal  spectrum  of  helios  .  The  theoretical  values  in  table  2  were  derived  from  a 
force-constant  model  used  in  fitting  phonon  dispersion  relations  to  neutron  inelastic 
scattering  spectra  (Dolling  et  al  1965). 

5.  Conclusions 

We  have  shown  that  pulsed  neutron  diffraction,  using  a  back-scattering  spectrometer 
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Table  3.  Thermal  displacements  of  thorium  dioxide. 


Maximum 


sin  0/2 

Number  of 

Mean  square  displacement 

Mean  square  displacement 

value  of 

reflections 

of  thorium  (A^) 

of  oxygen  (A^) 

(A-') 

in  pattern 

Measured 

Theory 

Measured 

Theory 

0-9 

44 

2-5  ±0-6 

5-8  ±0-8 

2-7 

5-4 

1-5 

130 

2-8  ±0-4 

5-4  ±0-3 

and  the  Harwell  electron  linear  accelerator,  can  lead  to  accurate  measurements  of  d- 
spacings  at  low  values  of  sin  0/>l  and  of  vibrational  parameters  and  atomic  coordinates 
at  high  values  of  sin  @/X.  The  performance  of  the  pulsed  neutron  instrument  matches 
or  exceeds  that  of  the  best  conventional  instrument  on  a  nuclear-reactor  source.  This 
augurs  well  for  the  future  of  the  Spallation  Neutron  Source,  which  is  under  construction 
at  the  Rutherford  Appleton  Laboratory  of  the  uk  .  The  neutron  flux  of  the  sns  will  be 
over  one  hundred  times  that  of  helios. 
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Abstract.  A  general  picture  of  electro-  and  magnetogyration  is  presented  together  with  the 
results  obtained  in  the  study  of  these  phenomena  in  the  crystals  of  linear  dielectrics  and 
ferroelectrics. 


Keywords.  Electrogyration;  magnetogyration;  crystallography;  phase  transition. 


1.  Introduction 

Electrogyration  and  magnetogyration  as  new  physical  phenomena  were  predicted 
rather  recently  on  the  basis  of  general  relations  of  tensor  crystallography  (Zheludev 
1964,  1980,  1982).  After  several  experiments  on  the  observation  and  study  of 
electrogyration  (Vlokh  1970;  Vlokh  et  al  1972,  1975)  these  investigations  were 
rewarded  with  a  diploma  for  a  discovery  (Zheludev  and  Vlokh  1980).  The  first 
communication  on  the  observation  of  magnetogyration  appeared  quite  recently  (Vlokh 
1981,  1982). 

2.  Electrogyration 

Electrogyration,  by  definition,  consists  of  the  effect  of  electric  field  on  the  gyrotropy  of 
crystals.  Analytically,  it  can  be  represented  by  the  power  expansion  of  gyration  tensor 
gij  (second-rank  axial  tensor)  in  electric  fields  E: 

9ij  =  d?j  +  yijkEk  +  PijkiEkEi+  .  .  .  , 

where  gfj  is  the  gyration  tensor  in  the  absence  of  the  field,  and  and  are  third-  and 
fourth-rank  axial  tensors  describing  linear  and  quadratic  electrogyration  effects, 
respectively.  Linear  electrogyration  occurs  in  all  crystal  classes  except  for  m3m,  43m  and 
432;  with  tensor  being  the  same  as  those  describing  piezomagnetism  (Zheludev 
1983). 

Phase  transition  in  ferroelectrics  with  the  appearance  (or  disappearance)  of 
spontaneous  polarization  Pj  provides  a  natural  possibility  to  observe  and  study 
spontaneous  electrogyration.  The  most  simple  and  reliable  are  the  cases  where  a 
centrosymmetric  optically  inactive  crystal  acquires  gyrotropy.  This  phenomenon  was 
observed  for  a  number  of  crystals.  In  particular,  the  studies  of  the  phase  transition  in 
triglycine  sulphate  (tgs)  crystals  near  49°C  show  that  at  temperatures  less*  than  7^ 
specific  rotation  of  the  polarization  plane  p  is  really  proportional  to  P^  (Vlokh  et  al 
1971). 
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For  some  ferroelectrics,  appearance  of  of  opposite  orientations  results  in  the 
formation  of  the  domains  with  enantiomorphism  of  different  signs  having  therefore 
optical  activity  of  different  signs.  This,  particularly,  enables  us  to  observe  both  change 
in  sign  of  optical  activity  and  gyration  hysteresis  in  ferroelectrics  {e.g.  in  5PbO  •  3Ge02 
crystals)  during  domain  repolarization  (Iwasaki  et  al  1972).  Of  special  interest  is  the 
possibility  to  obtain  high-induced  electrogyration  in  ferroelectric  crystals  in  the  phase 
transition  region.  Anomalies  of  induced  electrogyration  in  the  phase  transition  region 
are  determined  finally  by  dielectric  anomalies  (Vlokh  et  al  1977),  the  attainable  specific 
rotation  of  the  light  polarization  plane  being  as  high  as  several  degrees  in  the  field  of 
1  kV. 

Detection  of  forced  electrogyration  required  a  careful  search  for  suitable  crystals  and 
refined  experimental  technique.  Linear  electrogyration  has  been  observed,  measured 
and  studied  in  a-HIOj ,  LiI03 ,  PbMoO^  and  several  other  crystals  todate.  The  study  of 
electrogyration  in  optically  biaxial  a-HIOj  crystals  (class  222)  was  made  difficult  by  the 
accompanying  electrooptic  and  piezooptic  effects.  The  experimental  value  of  electro¬ 
gyration  coefficient  752  is  (4-33  ±0-70)  •  10“^^m/V  for  2  =  632-8  nm  (Vlokh  et  al 
1972). 

Electrogyration  in  uniaxial  crystals  of  classes  4,  6,  3,  etc.  is  not  so  complicated  by 
electrooptic  and  piezoeflfects  if  the  electric  field  applied  and  the  wave  normal  are 
directed  along  the  optical  axis  (the  crystal  remains  uniaxial).  Therefore,  experiments 
with  LilOj  crystals  belonging  to  class  6  (Vlokh  et  al  1975)  were  of  key  importance  in 
proving  the  observability  of  the  forced  electrogyration.  The  effect  was  measured  at 
—  140°C,  since  the  crystal  exhibited  ionic  conductivity  at  room  temperature.  The  value 
of  (1-57 +  0-30)  ■  10“^^  m/V  was  obtained  for  electrogyration  coefficient  733,  the 
contribution  of  piezoelectric  deformation  being  17  %  of  the  value  observed  (figure  1). 

The  most  noteworthy  are  the  experiments  on  forced  electrogyration  in  PbMo04 
crystals,  because,  unlike  the  above  crystals  optically  active  even  in  the  absence  of  the 
electric  field,  PbMo04  acquires  gyrotropy  only  when  the  field  is  applied  (the  crystal  is 
centrosymmetric,  its  symmetry  group  being  4/m).  Electrogyration  can  be  observed  in 
pure  form  without  the  usually  accompanying  electrooptic  and  piezoeffects.  Specific 
rotation  in  readily  obtainable  fields  (~  10  kV/cm)  attains  quite  considerable  values  in 


Figure  1.  Dependence  of  the  increment  of  the  specific  rotation  of  polarization  plane  on  the 
field  strength  for  LilOj  crystals. 
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this  crystal,  being  as  high  as  several  degrees.  The  phenomenon  is  characterized  by  the 
dispersion  (figure  2);  at  £  =  10  kV/cm  p  =  1-5  cm“^  for  X  =  700  nm  and  p  =  5  cm“^ 
for  X  =  400  nm.  Coefficient  ^33  (the  field  and  wave  normal  directed  along  the  optical 
axis)  is  (1-23  ±  0T2)-  10”^^m/V  for  X  —  517  nm  (Vlokh  et  al  1975). 

Quadratic  electrogyration  is  described  by  a  fourth-rank  axial  tensor  and  can  occur 
only  in  acentric  crystals  (Vlokh  et  al  1970).  It  was  experimentally  observed  in  quartz 
crystals  with  electric  field  applied  along  crystallographic  axes  and  plane-polarized  wave 
propagating  along  the  optical  axis.  The  value  of  the  quadratic  electrogyration 
coefficient  observed  was  =  (4-5  ±  0-34)  •10”^'^  (Vlokh  1971). 

3.  Magnetogyration 

There  is  a  number  of  well-known  and  well-investigated  magnetic  phenomena 
(Ramaseshan  1951).  Magnetogyration  is  a  new  one  consisting,  by  definition  (Zheludev 
1964),  of  the  ability  of  anisotropic  media  to  acquire  (or  to  change)  their  optical  activity 
gij  igij  is  a  symmetric  second-rank  tensor)  in  the  magnetic  field, 

9ij  9ijX-  ^ijklikX'  QijklElkEliX-  .  .  .  , 


Figure  2.  Dispersion  of  induced  optical  activity  of  PbMoO^  crystals  in  static  field 
E,  =  10  kV/cm. 


Figure  3.  Block  diagram  of  the  experimental  device  used  in  studying  Faraday  effect  and 
magnetogyration.  1,1' — He-Ne  lasers;  2,2' — A/4  plate  in  diagonal  position;  3,3' — rotatable 
polarizer  with  limb;  4 — sample  under  investigation;  5 — permanent  electromagnet;  6,6' — 
recording  system;  7,7' — galvanometer. 
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where  gfj  is  the  gyration  tensor  in  the  absence  of  the  magnetic  field;  and  and  Qij^t  are 
third-  and  fourth-rank  polar  and  axial  tensors  describing  linear  and  quadratic 
magnetogyration,  respectively.  Tensor  <5,^^  has  nonvanishing  components  only  in  the 
case  of  non-centrosymmetric  crystals  and  textures,  its  form  being  the  same  as  that  of  the 
respective  piezoeffect  tensors.  Magnetogyration  as  well  as  the  natural  optical  activity 
can  be  described  by  a  surface  of  the  “twisted  cylinder”  symmetry,  group  oo  2  (or  by  a  set 
of  such  surfaces). 

If  magnetic  field  is  applied  to  a  non-centrosymmetric  medium  (particularly,  to  a 
nonmagnetic  one),  the  rotation  of  the  light  polarization  plane  is  determined  by  the  two 
effects:  Faraday  effect  and  magnetogyration.  The  contributions  of  these  effects,  linear 
in  field,  can  be  separated  due  to  the  difference  in  their  symmetry  (Faraday  effect 
is  characterized  by  the  symmetry  of  “rotating  cylinder”,  group  oo/m).  One  of  the 
methods  of  their  separation  lies  in  measuring  the  rotation  of  the  polarization  plane  for 
the  light  passed  in  opposite  directions  (figure  3).  If  both  effects  occur  simultaneously, 
they  have  the  same  sign  {i.e.  add  up)  for  one  direction,  and  different  signs  {i.e.  subtract 
from  each  other)  for  the  opposite  direction. 


Figure  4.  Dependence  of  the  specific  rotation  of  polarization  plane  on  the  magnetic  field 
strength  for  CdS  crystals.  1 — wave  vector  is  +  k  2 — wave  vector  is  —k;  2' — curve  2  reflected 
with  respect  to  //^-axis. 
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Magnetogyration  represents  a  magnetic  analogue  of  electrogyration.  In  the  simplest 
cases,  the  former  consists  of  the  appearance  of  optical  activity  induced  by  a  magnetic 
field  (axial  action)  applied  along  the  exclusive  polar  direction  (crystals  of  3m,  4mm,  6mm 
symmetry),  whereas  the  latter  lies  in  the  appearance  of  the  optical  activity  induced  by  an 
electric  field  (polar  action)  applied  along  the  exclusive  axial  directions  (crystals  of  6, 
4/m,  6/m  symmetry).  In  particular,  by  analogy  with  the  occurrence  of  the  spontaneous 
electrogyration  in  the  phase  transitions  due  to  spontaneous  polarization  in  the 
crystals  of  this  symmetry,  spontaneous  magnetization  in  crystals  of  classes  3m,  4mm, 
6mm  results  in  the  spontaneous  magnetogyration.  Magnetogyration  is  a  consequence 
of  the  spatial  dispersion,  i.e.  the  dependence  of  the  quantities  characterizing  the 
interaction  of  electromagnetic  wave  with  medium  on  the  wavelength  (wave  vector) 
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Figure  5.  Dependence  of  the  specific  rotation  of  polarization  plane  on  the  magnetic  field 
strength  for  PbsGesOi  i :  Eu^  ^  crystals  in  different  states  of  spontaneous  electrogyration.  1,2, 
3,  4,  5  — at  20,  50,  120,  140  and  160°C,  respectively;  x — in  polydomain  crystals  (plots  are 
matched  to  the  initial  points  of  monodomain  crystals,  because  in  polydomain  crystals  p  =  0  at 
H  =  0);O,  • — in  monodomain  crystals  at  opposite  directions  of  spontaneous  polarization; 
6 — at  180°C  (paraphase);  7 — for  quartz  crystals  at  opposite  directions  of  the  wave  vector 
(O,  •)  and  from  the  results  of  double  propagation  measurements  (x). 
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Temp.  (°C) 

Figure  6.  Temperature  dependence  of  the  increments  of  the  specific  rotation  of  polariz¬ 
ation  plane  for  PbsGesOu:  crystals  at  =  3  5  kE.  1 — in  polydomain  crystals 

(Faraday  effect);  2,3 — in  monodomain  crystals  at  the  opposite  signs  of  enantiomorphism 
(superposition  of  Faraday  effect  and  magnetogyration).  The  curie  point  of  the  crystal  is  161°C. 


(Agranovich  and  Ginzburg  1979).  Magnetogyration  is  determined  by  the  part  of  this 
dependence  that  appears  due  to  or  is  changed  by  magnetic  field. 

Linear  magnetogyration  was  first  studied  in  the  crystals  of  linear  dielectrics  LilOj 
and  Si02  (Vlokh  1981, 1982).  For  LilOj  crystals  (symmetry  class  6)  the  obtained  values 
of  coefficients  *33  (for  Faraday  effect)  and  <533  (for  magnetogyration)  were  (11 11 5 
+  004)  10”^*  and  (0-51  +0  06)  •  10"“  rad/Oe,  respectively.  Quartz  crystals  were 
studied  mainly  to  ascertain  whether  Faraday  effect  is  not  accompanied  with  magneto¬ 
gyration  if  field  H  is  directed  along  z-axis  (in  third-rank  polar  tensor  <533  =  0  for  the 
crystals  of  class  32).  In  CdS  crystals  magnetogyration  should  appear  in  pure  form  if 
magnetic  field  applied  is  directed  along  the  optical  axis.  In  this  crystal  a33  =  (501-8 
+  0-3)- 10”“  rad/Oe  and  ^33  =  (4-4 +  0.5)- 10"“  rad/Oe  (figure  4). 

In  polydomain  ferroelectrics  with  spontaneous  polarization  in  centrosymmetric 
phase,  induced  linear  magnetogyration  does  not  occur  (tgs,  BaTi03,  etc.).  If  original 
modification  is  optically  active  (e.g.  in  Rochelle  salt),  then  polydomain  crystal  displays 
magnetogyration.  In  the  cases  where  domain  of  opposite  orientation  is  characterized  by 
enantiomorphism  of  different  signs  (transitions  with  the  change  in  symmetry)  e.g.  2/m 
->  2  (tgs),  6-^3  (lead  germanate,  Pb5Ge3  0ii,  etc.),  crystals  monodomainized  by 
electric  fields  of  opposite  directions,  being  affected  by  magnetic  field,  show  magneto¬ 
gyration  of  opposite  signs.  Studies  of  magnetogyration  in  lead  germanate  crystals  gave 
a33  =  (151-5 +  0-2)-  10"“  rad/Oe  and  ^33  =  (2-5 +  0-3)-  10"“  rad/Oe  (figure  5).  In 
the  phase  transition  temperature  region  (near  162°C)  magnetogyration  in  lead 
germanate  demonstrates  an  anomalous  behaviour  and  at  higher  temperatures  vanishes 
(figure  6). 
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A  theoretical  consideration  of  the  intermediate  phase  in  transition 
of  quartz 


N  KATO 

Department  of  Crystalline  Materials  Science,  Faculty  of  Engineering,  Nagoya  University, 
Chikusa-ku,  Nagoya,  Japan. 

Abstract.  Recently,  a  thermodynamically  stable  phase  between  a  (low)  and  p  (high)  phases  of 
quartz  has  teen  found.  This  paper  explains  the  three  phases  by  a  statistical  theory  of  Bragg- 
William’s  type  by  assuming  three  states  of  Si04  tetrahedron  along  a-axis.  Adjusting  the 
interaction  parameters  one  can  predict  the  successive  transitions  of  the  first-order  (T^.)  and  the 
nearly  second-order  (Tq).  Also  with  an  assignment  of  T^-  =  573°C,  one  obtains  Tq  =  T(-+  1-2 
degree  which  fits  to  the  experiment.  The  difference  in  energy  between  Dauphine  twin  bonding 
and  the  normal  bonding  is  estimated  to  be  0-525  eV. 

Keywords.  Phase  transition  theory;  quartz. 

1.  Introduction 

Recently,  we  found  an  intermediate  phase  between  a  (low)  and  fi  (high)  phases  of  quartz 
(Gouhara  et  al  1983).  Figure  1  is  the  most  direct  illustration  of  the  new  phase.  The  figure 
is  an  x-ray  oscillating  topograph*  (Schuttky  1965)  near  the  transition  temperature 
{T(-  =  573°C)  and  under  a  small  thermal  gradient.  The  differences  in  the  intensity  of 
different  phases  are  mainly  due  to  the  difference  in  the  degree  of  extinction.  The  details 
of  the  contrasts  are  given  in  Gouhara  et  al  (1983). 

The  temperature  region  of  the  intermediate  phase  (ip)  is  about  2  degrees  above  7^ . 
Several  evidences  show  that  the  transition  between  a  and  ip  is  of  first-order  and  the 
transition  between  ip  and  ji  is  of  second-order.  Recently,  distinct  satellite  spots 
were  also  observed  near  the  strong  Laue  spots  (Gouhara ct  al  1983).  Analysis  shows  that 
IP  has  a  character  of  incommensurate  phase**. 

This  paper  intends  to  show  that  the  successive  phase  transitions  can  be  understood 
by  statistical  mechanics  assuming  three  energy  states  of  Si04  tetrahedron.  Historically, 
Wannier  (1959)  proposed  a  theory  of  Bragg- William’s  type  assuming  two  energy  states 
but  the  transition  was  predicted  to  be  of  the  second-order.  Recent  experimental 
evidences  favour  the  first-order  transition.  The  present  theory  can  predict  that  the 
transition  between  a  and  ip  is  of  first-order  whereas  the  transition  between  ip  and  ^  can 
be  of  second-order  or  at  least  a  weak  first-order.  Our  experimental  results  agree  with 
this  prediction. 

2.  Free  energy  of  the  system 

Let  us  consider  the  structure  of  quartz  in  figure  2,  which  illustrates  only  Si  atoms  in 
)5-phase.  Si  locates  in  the  middle  of  three  equivalent  a-axes  of  a  unit  cell.  The  crystal  has 


*  This  is  essentially  equivalent  to  the  traverse  pattern  of  Lang’s  technique. 

**  Recently,  a  similar  result  obtained  by  means  of  neutron  diffraction  is  reported  (Dolino  and  Bachheimer 
1983).  Since  no  experiment  curve  is  published  we  cannot  compare  their  results  with  ours. 
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Figure  1.  Three  phases  of  quartz  in  x-ray  diffraction  topograph  taken  near  573°C. 


six-fold  axes.  In  a-phase,  Si  is  displaced  by  ±  u  along  a-axis.  The  crystal  then  transforms 
to  the  structure  of  three-fold  symmetry.  The  phases  corresponding  to  +  are  referred  to 
aj  and  a2,  which  are  mutually  equivalent.  The  macroscopic  coexistence  is  traditionally 
called  Daupine  twin.  Incidentally,  the  separation  by  faint  white  curves  in  the  a  phase  of 
figure  1  can  be  interpreted  to  be  the  twinning  of  this  sort,  which  was  caused  by  thermal 
strain  when  the  temperature  of  the  specimen  was  decreased  from  phase. 

In  the  present  theory,  it  is  assumed  that  Si  can  take  three  positions  on  a-axis;  u  =  0 
and  ±  u.  The  oxygen  tetrahedron  is  associated  to  each  Si.  They  also  take  three  states 
corresponding  to  the  positions  of  Si.  Henceforth,  the  three  states  are  denoted  by  the 
suffixes  0,  1  and  2. 

Since  Si-O  bonding  is  homopolar,  it  is  reasonable  to  take  into  account  only  the 
interaction  between  the  nearest  neighbours  of  tetrahedron  units.  The  interaction 
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Quartz 


«  P3,21 
B  ;  P622 


Tc=573c 


Figure  2.  Static  models  of  ^-quartz  (Si  only). 


energy  is  listed  in  table  1.  Besides,  we  assume  the  intrinsic  energy  of  the  unit,  which  are 
denoted  by  and  =  £2  =  £• 

Assuming  the  mean-field  approximation  (Bragg-William’s  approximation),  one  can 


write  the  total  energy  in  the  form 

U  IN  =  UQ  +  mQ  +  2xnl  +  Ajinin2.  (la) 

In  the  special  case  of  Ui  —  n2,  it  has  the  form 

U /N  =  Uo  +  Ji+ {I- 2]l)nQ  + {2f +  Ji)nQ.  (lb) 

Here,  N  is  the  total  number  of  the  tetrahedron  unit,  and  n,  is  the  population  ratio  in  the 
state  i.  Obviously,  the  condition 

Mo  +  «1  +  «2  =  1  (2) 

must  be  satisfied.  The  (reduced)  energy  parameters  Uq,  I  etc.  are  given  by 

Uo  =  e  +  2t,  (3a) 

£  =  £0 -e  — 4x -f  4/Zo,  (3b) 

T  =  To-bT-2//o,  (3c) 

Ji  =  p  —  x.  (3d) 

Since  a-phase  is  stable  at  a  sufficiently  low  temperature,  U {hq  =  1,  =  n2  =  0)  niust 

be  larger  than  17(no  =  0,  Uj  =  1,  ^2  =  0)-  This  implies  that 

2t  >  0.  (4) 


Table  1.  Interaction  energy  of  the 
nearest  neighbours  of  Si04  tetrahedron. 
Symmetry  is  taken  into  account. 
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Also,  one  can  anticipate  the  right  pair  (i  —  j)  has  less  interaction  energy  than  the  wrong 
pair  (i  ^  j).  Then,  the  following  inequalities  must  be  fulfilled. 

£>0,  T<0,  ^>0  (5a, b, c) 


The  entropy  S  is  the  mixing  entropy  of  Nhq,  Nrii  and  Nn2  in  N  sites.  Therefore,  it  has 
the  expression 

iV!  \ 


S  =  klogi  {l/Nf  ,  , 

\  no!«i'«27 

=  -  kN[nQ  log  Uo  +  «i  log rii  +  Uj  log H2]* 


(6a) 

(6b) 


where  k  is  the  Boltzmann  constant  and  Stirling’s  approximation  is  used  in  the  last 
expression. 

In  the  theormodynamical  equilibrium  the  Helmholtz-free  energy  [f  =  U  —  TS] 
must  be  minimum  with  respect  to  two  independent  variables  of  Mq.  and  n2-  Various 
expressions  are  available  as  the  equilibrium  conditions.  For  example. 


noexp[{(£-4/x)-4(|T|-/x)no}A7’]  =  Uj  exp[-4/mi/fc7’],  (7a) 

=  ^2  exp[  — 4/in2A7’].  (7b) 

Wannier’s  two-state  theory  (Wannier  1959)  (i  =  1,  2)  is  a  special  case  of  £  |>  4ji.  In 
this  case,  the  equilibrium  condition  reduces  to  the  last  relation  (7b).  The  phase 
transition  occurs  at 

=  2jl/k.  (8) 

The  present  theory  is  more  versatile.  Depending  on  the  energy  paramters,  the  various 
types  of  the  phase  transitions  are  described.  Since  this  paper  intends  to  show  the 
possibility  of  the  successive  transitions  (a-ip-)3),  we  shall  not  discuss  the  details  and 
present  only  some  numerical  results  in  the  next  section.  As  to  the  formal  discussion,  it  is 
worth  referring  to  the  paper  of  Nakano  (1964)  which  dealt  with  a  statistical  system  of 
unity  spin. 


3.  Numerical  analysis 

Figure  3  is  a  series  of  the  contour  diagram  of  the  free  energy.  Here,  we  shall  define  a 
normalized  temperature 

R=TIT^.  (9) 

At  low  temperature  (R  =  0-65),  a  large  minimum  A  and  a  small  minimum  51  are 
expected.  Increasing  temperature  {R  =  0-71),  the  minimum  A  becomes  shallow  and  B1 
becomes  deep.  Further,  by  increasing  the  temperature,  A  and  51  disappear  and  another 
minimum  52  appears. 

To  understand  more  precisely  the  behaviour  of  51  and  52,  the  minimum  and 
maximum  positions  of  are  plotted  against  R  under  the  condition  of  nj  =  n2-  One 
example  is  shown  in  figure  4.  The  minimum  below  the  line  of  Mq  =  1/3  is  not  a  real 
minimum  but  a  saddle  point.  A  phase  transition  is  expected  between  Tl  and  T2  with 
hysteresis.  In  this  sense,  the  transition  is  of  first-order.  If  Tl  and  T2  are  very  close,  it  is 
nearly  of  second-order.  This  is  the  case  of  quartz. 
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Figure  4.  Temperature  dependence  of  at  the  minima  (solid  curve)  and  the  maxima 
(broken  curve)  of  free  energy. 


In  figure  3  and  4,  for  the  sake  of  clear  illustration,  the  energy  parameters  used  are  not 
real  ones  (e/^  =  3-925;  |  t  |  //I  =  1  -96).  Nevertheless,  one  can  identify  the  minima  of  A,  Bl 
and  B2  to  the  a,  ip  and  j?  phases,  respectively. 

After  several  trials,  we  finally  obtained  a  set  of  parameters,  which  are  realistic  in  the 
sense  that  the  phase  transitions  occur  successively.  The  parameters  employed  are 

e/Ji  =  3-817,  \x\/Ji=  1-890.  (10a,  b) 
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Then  the  transition  occurs  at 

a^/P:  Pc  =  0-6937,  (11a) 

:  Pg  =  0-69465.  (lib) 

If  Pc  is  assigned  to  the  temperature  Pc  =  573°C,  we  shall  see 

Tq—T(,  =  1-2  degree,  (12) 

which  is  slightly  less  than  the  experimental  value  (Gouhara  et  al  1983).*  By  the 
definitions  of  and  P  (equations  (8)  and  (9)),  we  also  see 

7V  =  946-5°C,  (13) 

and 

^  =  0-525  eV.  (14) 


4.  Discussion 

The  present  theory  is  no  more  than  a  parameter  theory,  although  it  is  microscopic.  The 
parameters  employed,  therefore,  must  be  regarded  merely  as  “effective  ones”. 
Nevertheless,  it  is  worth  discussing  their  physical  implications,  particularly  of  the 
numerical  figures  given  by  (10)  through  (14). 

First,  although  the  intrinsic  energies  (eq  and  e)  were  formally  introduced,  they  must 
be  practically  null.  Secondly,  the  main  part  of  the  interaction  energies  (ie)  listed  in 
table  1  is  the  bonding  energy  of  Si-O-Si.  Thirdly,  the  deformation  of  tetrahedron  units 
is  negligible.  This  is  justified  to  some  extent  by  the  structural  study  of  Young  (1962). 

The  figures  of  ie  were  sought  under  the  requirements  given  by  (4)  and  (5).  From 
these  figures,  with  the  use  of  a  reasonable  working  hypothesis  (cr  =  0),  one  can  obtain 

=  Ji,  fio  =  0-954ji,  To  =  0-018/1,  (15a,b,c) 

The  modification  with  a  finite  t  comparable  with  Tq  does  not  change  the  sequential 
order  of  the  magnitudes  in  (15)  which  is  quite  reasonable.  Incidentally,  /i  is  the  bonding 
energy  per  bond  of  the  interface  between  aj  and  0L2  states. 

Very  little  information  is  available  for  criticizing  the  Ji  values  of  (14).  It  is  reported 
(Klassen-Neklyudova  1964)  that  indentation  with  a  steel  ball  bearing  can  produce 
twinning  even  at  room  temperature.  A  rough  estimation  of  the  required  pressure  P  was 
1000  kg/mm^  or  more.  The  elastic  energy  absorbed  per  unit  cell  divided  by  six  (the 
number  of  bonding)  will  trigger  the  twinning.  Here  again  the  deformation  of  the 
tetrahedron  is  neglected.  Based  on  this  model,  one  can  estimate  /i  by  /1 2(1 /C)P^e 
(C:  elastic  constant,  10“  dyne/cm^;  v:  volume  of  unit  cell),  which  amounts  to 

X  6-24  X  10“^  eV.  Here,  a  is  introduced  as  a  multiplication  factor  to  the  pressure 
mentioned  above.  Since  a  =  3  ~  5  is  not  unrealistic,  the  figure  given  by  (14)  is 
satisfactory  at  the  present  state  of  the  art. 

A  question  may  arise  for  any  theory  of  the  present  type;  namely  whether  the  state  0  is 
really  stable  in  the  potential  curve.  The  present  author  believes  that  the  physical  reality 
is  a  continuous  distribution  of  the  tetrahedron  unit  along  a-  axis  under  a  self-consistent 


*  It  is  not  difficult  to  improve  the  calculation. 
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potential  field.  Thus,  the  problem  of  whether  the  potential  curve  has  double  or  triple 
minima  is  not  significant.  The  double  minimum  theory  of  Wannier  (1959)  cannot 
predict  properly  the  intermediate  phase.  The  success  of  the  present  theory  implies  that 
the  sojourn  of  the  tetrahedron  unit  within  a  limited  range  of  a-coordinate  is  significant. 
The  physical  interpretation  of  the  present  theory,  therefore,  must  be  different  from  that 
for  the  statistical  system  of  unity  spin. 
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Crystal  and  molecular  structure  of  pyrrolidine  dithio  carbamate 
cobalt  (ill)  chelate 


GOPINATH  KARTHA  and  KOTRA  V  KRISHNAMURTHY* 

Roswell  Park  Memorial  Institute,  Buffalo,  New  York  14263,  USA 
♦United  States  Testing  Co.,  Richland,  Washington  99325,  USA 

Abstract.  Pyrrolidine  dithio  carbamate  (pdtc)  cobalt(iii)  chelate  has  been  used  as  a 
coprecipitation  matrix  for  obtaining  preconcentration  factors  of  100  or  more  for  analytical 
estimation  of  trace  metals  in  natural  waters.  This  chelate,  suitable  for  bivalent  and  trivalent 
ions,  crystallizes  as  dark  green  crystals  in  monoclinic  space  group  C2/c  with  unit  cell 
parameters  a  =  28-89  A,  b  =  14-39  A,  c  =  1617  A  and  =  104-2°.  X-ray  crystallographic 
analysis  of  these  crystals  reveals  that  three  pdtc  are  coordinated  to  a  Coq  ^  atom  through  their 
sulphur  atoms,  the  six  sulphur  atoms  forming  a  distorted  octahedra  around  the  metal  atom  at 
an  average  distance  of  2-26  A.  The  thiocarbamate  part  of  the  ligands  is  nearly  planar  but  the 
pyrrolidine  rings  are  puckered. 

Keywords.  Pyrrolidine  dithio  carbamate;  cobalt-chelate;  crystal  structure. 


1.  Introduction 

The  occurrence  of  heavy  metals,  especially  lead  and  cadmium  in  the  environment  is  now 
recognized  as  a  major  health  problem.  The  abundance  of  these  heavy  metals  in  natural 
waters  has  been  directly  linked  to  the  extent  of  their  low  level  toxic  interaction  with 
tissues  such  as  kidney.  With  the  low  ambient  level  of  Cd  ( <  1  ppb)  in  public  waters, 
available  analytical  methods  are  inadequate  and  the  method  of  coprecipitation  for 
preconcentrations  of  trace  metals  prior  to  their  analytical  determination  by  atomic 
absorption  spectroscopy  has  special  advantage  in  reducing  large  volumes  of  the  sam¬ 
ple  to  a  few  milligrams  of  the  material.  Coprecipitation  of  trace  heavy  metals  on  tris- 
pyrrolidine  dithio  carbamato  cobalt  (iii)  has  been  developed  earlier  for  the  study  of  sea 
water  by  Boyle  and  Edmond  (1975).  Krishnamurthy  and  Reddy  (1977)  extended  the 
procedure  to  include  other  natural  waters,  soil  extracts  and  sediment  digestates. 
Concentration  factors  ranging  from  40  to  400  are  obtained  in  this  method  and  it  is 
particularly  useful  for  preconcentration  of  Pb,  Cd,  Ni  and  Cu.  In  this  study  the 
coprecipitation  matrix  has  been  the  subject  of  investigation  in  view  of  its  extraordinary 
thermal  and  photochemical  stability  under  ambient  conditions  as  also  its  versatility  to 
retain  several  trace  metals  such  as  Cu,  Cd,  Pb,  Ni,  Fe  and  Cr  but  not  Mn  and  Zn.  The 
dithio  carbamate  ligand  is  also  of  great  biological  interest  due  to  its  action  on  the 
growth  and  viability  of  sarcoma  cells  and  fibrocytes,  use  in  the  treatment  of  alcoholism 
and  its  function  as  a  protectant  against  lethal  effects  of  radiation  (Powell  1954;  Cohen 
et  al  1957).  The  structural  studies  of  these  ligand  metal  complexes  provides  much 
needed  information  on  the  metal-sulphur  distances  for  characterizing  the  heavy  metal 
sulphur  binding  in  the  tissue.  In  this  paper  we  report  the  results  of  our  study  on  the 
crystal  structure  of  the  matrix  cobalt  (iii)- (pyrrolidine  dithio  carbamate)3. 
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2.  Experimental 

Several  batch  samples  of  the  cobalt-chelate  are  prepared  under  the  coprecipitation 
conditions  used  in  the  analysis  of  natural  waters.  The  procedure  is  as  follows:  to  one 
litre  of  di-ionized  distilled  water  are  added  5  ml  of  6  M  HNO3  and  2  5  ml  of  Co(N03)2 
stock  solution  (~  2000  pg  Co  (ii)  per  10  ml)  followed  by  10  ml  of  NH4OAC  buft'er 
solution  (170g  NH4O-AC  in  5000  ml  of  water  purified  by  ADPC-CCt4  extraction  3 
times).  The  pH  of  the  mixture  should  be  ~  4.  A  2  %  solution  of  the  precipitant, 
ammonium  pyrrolidine  dithio  carbamate  (apdc)  in  the  meanwhile  is  purified  by 
repeated  CCI4  extractions  and  5  ml  of  this  reagent  are  added  to  the  cobalt  mixture.  The 
contents  of  the  beaker  are  allowed  to  stand  for  30  minutes  with  intermittent  stirring 
when  a  yellowish  green  opalescence  develops  due  to  the  formation  of  cobalt  (n) 
pyrrolidine  dithio  carbamato  chelate.  The  yellowish  green  colour  gradually  changes  to 
green  on  standing  due  to  ready  oxidation  of  the  chelate  to  a  +  3  cobalt  state  (Gleu  and 
Schwab  1950;  Malissa  and  Schoffmann  1955).  The  cobalt  (iii)  chelate  thus  obtained  is 
filtered  through  a  0-45  millipore  filter  and  air  dried  at  room  temperature  to  yield  a  green 
flaky  material  exhibiting  creep  properties.  Several  one  liter  batch  precipitations  are 
filtered  through  the  same  filter  paper  to  increase  yield  and  facilitate  sample  handling. 
Microanalytical  results  using  Perkin-Elmer  240  elemental  analyzer  gave  molecular 
formula  CON3S6C15H24  (M.W.  497)  corresponding  to  1 :  3  chelation  between  cobalt 
and  the  ligand. 

The  Co  (pdtc)3  is  dissolved  in  methyl  isobutyl  ketone  and  recrystallized  to  yield  dark 
green  shining  crystals.  The  crystals  are  monoclinic,  C  centered  with  a  —  28-89  A,  b 
=  14-39  A,  c  =  16-17Aand^  =  104-2°.Thex-ray  diffraction  intensities  were  measured 
to  a  Bragg  angle  of  60°  using  nickel  filtered  Cu  radiation  (2  =  1-542  A)  on  a  General 
Electric  490  automatic  diffractometer  by  stationary  crystal  stationary  counter  method 
(Furnas  and  Marker,  1 955).  The  crystals  were  very  small  and  poor  scatterers  and  so  7 1 1 6 
diffraction  data  from  two  equivalent  sets  were  measured  and  averaged  to  give  3506 
unique  reflections.  Only  about  2100  of  these  (60%)  had  intensities  greater  than  3(t(I) 
and  were  considered  observed  and  used  in  structure  refinement.  The  intensities  were 
corrected  for  geometrical  factors  in  the  usual  way.  No  absorption  corrections  were 
applied. 


3.  Structure  determination 

Systematic  absence  hkl  for  +  k  odd  and  hoi  for  /  odd  revealed  C  centering  and  a  glide 
plane  parallel  to  c  axis.  The  intensity  statistics  favoured  a  centrosymmetric  space  group. 
Hence  the  structure  analysis  and  refinements  were  carried  out  in  space  group  C2/c.  The 
measured  density  and  volume  of  the  unit  cell  indicated  12  molecules  of  Co  (pdtc)3  in 
the  unit  cell.  In  space  group  C2/c  this  implies  one  molecule  in  the  general  position  and 
another  in  a  special  position  on  the  two-fold  axis.  The  location  of  the  cobalt  atoms  were 
found  from  a  sharpened  Patterson  map  and  all  the  remaining  37  atoms  were  found  and 
refined  by  Fourier  and  least  squares  methods.  Positional  and  anisotropic  thermal 
parameters  of  all  but  the  three  atoms  on  the  two  fold  axes  were  refined  by  block 
diagonal  least  squares  and  the  usual  R  factors  for  2105  reflections  used  in  the  structure 
determination  and  refinement  is  12%.  Fractional  atomic  coordinates  and  equivalent 
thermal  parameters  are  given  in  table  1.  Lists  of  observed  and  calculated  structure 
factors  are  also  included  as  supplementary  material. 
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Table  1.  Fractional  coordinates  and  equivalent  B  values  of  atoms. 


Atom 

Xja 

Y/b 

Z/c 

^eq 

Atom 

XIa 

Ylb 

Z/c 

^eq 

COA 

0-1673 

0-4609 

0-5303 

5-6 

C5A 

0-2134 

0-7214 

0-3607 

4-5 

COB 

0-0000 

0-0408 

0-7500 

5-4 

CIB 

0-2419 

a3775 

0-6471 

3-9 

SIA 

0-1366 

0-6072 

0-5153 

5-1 

C2B 

0-3080 

0-3839 

0-7745 

4-8 

S2A 

0-2108 

0-5308 

0-4498 

4-4 

C3B 

0-3402 

0-3083 

0-8206 

6-1 

S3A 

0-2117 

0-3299 

0-5519 

4-1 

C4B 

0-3402 

0-2324 

0-7521 

5-3 

S4A 

0-2220 

0-4870 

0-6574 

4-7 

C5B 

0-2959 

0-2415 

0-6860 

4-2 

S5A 

0-1093 

0-4033 

0-5862 

5-3 

CIC 

0-0796 

0-3665 

0-4856 

5-1 

S6A 

0-1144 

0-4004 

0-4158 

3-9 

C2C 

0-0180 

0-2929 

0-3713 

3-4 

SIB 

0-0627 

-0-0586 

0-7785 

4-8 

C3C 

-0-0323 

0-2784 

0-3786 

5-4 

S2B 

0-0153 

0-0258 

0-8918 

5-2 

C4C 

-0-0277 

0-2435 

0-4681 

5-3 

S3B 

0-0487 

0-1651 

0-7510 

5-3 

C5C 

0-0106 

0-2993 

0-5292 

5-3 

NA 

0-1813 

0-7098 

0-4158 

5-5 

CID 

0-0649 

-0-0430 

0-8852 

3-9 

NB 

0-2790 

0-3369 

0-6987 

5-6 

C2D 

0-1430 

-0-1155 

0-9459 

5-6 

NC 

0-0374 

0-3198 

0-4642 

4-9 

C3D 

0-1734 

-0-0942 

1-0322 

5-9 

ND 

0-0953 

-0-0684 

0-9527 

5-1 

C4D 

0-1409 

-0-0954 

1-0951 

6-1 

NE 

0-0000 

0-3216 

0-7500 

6-8 

C5D 

0-0927 

-0-0459 

1-0421 

6-0 

CIA 

0-1772 

0-6288 

0-4549 

4-6 

CIE 

0-0000 

0-2336 

0-7500 

5-4 

C2A 

0-1539 

0-7942 

0-4283 

4-4 

C2E 

-0-0441 

0-3785 

0-7419 

6-2 

C3A 

0-1710 

0-8666 

0-3728 

5-7 

C3E 

0-2250 

0-4799 

0-7292 

6-4 

C4A 

0-1873 

0-8082 

0-3051 

6-0 

4.  Discussion 

The  coordination  geometry  of  the  chelate  complex  and  the  atom  numbering  scheme  are 
shown  in  figure  1.  A  stereoscopic  drawing  of  the  molecule  in  the  general  position  is 
given  in  figure  2.  There  is  another  molecule  in  the  asymmetric  part  of  the  unit  cell  which 


Figure  1 .  The  coordination  of  the  bidentate  ligand  around  the  cobalt  atom  is  shown.  The  six 
sulphur  atoms  form  a  distorted  octahedron  around  the  cobalt  atom  with  an  average  Co-S 
distance  of  2-26  A.  There  is  another  similar  molecule  in  the  asymmetric  unit  sitting  with  the 
cobalt  atom  and  Cj  and  N  atoms  of  one  of  the  ligand  molecules  on  a  crystallographic  two  fold 
axis.  These  molecules  are  also  repeated  by  the  glide  and  inversion  symmetries  of  the  space 
group. 
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Figure  2.  The  stereo  view  of  the  Co  (pdtc)3  molecule.  This  stereo  pair  may  be  conveniently 
viewed  to  see  three  dimensional  arrangement  with  some  practice  or  using  simple  hand  held 
stereo  viewers. 


Figure  3.  Average  dimensions  of  a  pyrrolidine  dithio  carbamate  ligand.  The  carbamate  part 
is  planar  but  the  pyrrolidine  rings  are  puckered. 


is  very  similar  but  sitting  with  the  crystallographic  two  fold  axis  passing  through  cobalt, 
ciE  and  NE  atoms. 

Each  molecule  is  formed  by  three  pdtc  ligands  coordinated  to  the  Co  atoms  through 
their  sulphur  atoms.  The  six  sulphur  atoms  form  a  distorted  octahedron  around  the  Co 
atom  with  an  average  Co-O  distance  of  2-26  A.  The  distortion  from  regular  octahedron 
is  imposed  by  the  molecular  geometry  resulting  in  S1-C0-S2  angle  of  77°  while 
S1-C0-S3  and  S1-C0-S4  are  169  and  96  degrees  respectively. 

All  atoms  in  the  pdtc  ligand  lie  very  nearly  in  a  plane  with  the  cobalt  atom.  The  dithio 
carbamate  part  is  mostly  planar,  but  the  pyrrolidine  part  is  puckered.  The  C  -N  and  S-C 
bonds  are  appreciably  shortened  and  show  partial  double  bond  character.  The  bond 
lengths  and  angles  in  the  five  independent  ligands  are  very  similar  and  figure  3  shows 
the  average  dimensions  for  these.  These  bond  lenghs  and  angles  do  not  show  significant 
deviations  from  other  similar  complexes  (Merlino  1968;  Brennen  and  Bernal  1969; 
Healy  and  White  1972)  involving  both  divalent  and  trivalent  metals.  This  seems  to 
indicate  that  for  dithio  carbamate  ligand,  the  contributions  of  various  resonance 
structures  are  not  much  affected  by  the  nature  of  the  central  atoms  or  the  crystal 
geometry.  These  resonance  stabilized  hybrids  presumably  readily  react  with  radiation 
induced  radicals  giving  the  dithio  carbamate  the  ability  to  protect  the  sensitive 
biological  targets  from  effect  of  radiations. 

The  pyrrolidine  ring  shows  some  interesting  variations  for  the  different  pdtc  ligand 
molecules  in  the  asymmetric  unit.  The  intra  ring  torsion  angles  for  the  five  different 
pyrrolidine  rings  are  given  in  table  2  as  well  as  their  pseudo  rotation  amplitude  and 
phase  angle  (Altona  and  Sundaralingam  1972)  parameters.  It  must  be  remembered  that 
because  of  the  space  group  symmetry  of  the  crystal  there  are  also  an  equal  number  of 
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Table  2.  Torsion  angles  of  pyrrolidine  rings. 


1 

2 

3 

4 

5 

;(0  (C^-N-C“-C'’) 

-1 

-8 

16 

-16 

-16 

Xl  (N-C'-C^-C^) 

25 

22 

-32 

34 

40 

X2  (e-C^-C^-C”*) 

-39 

-30 

43 

-41 

-48 

X3  (C^-C^-C^-N) 

37 

24 

-29 

28 

40 

X4  (C’’-C’-N-C"') 

-23 

-9 

7 

-7 

-16 

Pseudo  rotation  amplitude 

40 

29 

40 

40 

49 

Pseudo  rotation  phase 

268 

254 

66 

244 

252 

rings  for  which  the  torsion  angles  of  each  ring  are  reversed  in  sign  and  hence  have  a 
pseudo  rotation  phase  angle  which  is  180  degrees  in  advance.  Except  for  the  ring  1 
which  has  an  envelope  conformation  with  the  atom  out  of  the  plane  formed  by  the 
other  four  atoms  (Kartha  et  al  1974),  all  other  pyrrolidine  rings  are  significantly  twisted 
and  have  conformations  similar  to  those  observed  in  other  studies  (Druyan  et  al  1976; 
Kartha  et  al  1982)  also. 

The  sulphur  atoms  of  the  different  ligands  around  the  cobalt  atom  show  contacts  of 
3-27  to  3-33  A  and  correspond  to  some  of  the  shortest  S-S  contacts  reported  in  the 
literature. 
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Electron  density  distribution  in  potassium  bis-(carbonato)cuprate(II) 


E  N  MASLEN,  N  SPADACCINI  and  K  J  WATSON 

Crystallography  Centre,  University  of  Western  Australia,  Nedlands,  Western  Australia  6009 

Abstract.  The  electron  density  distribution  in  potassium  his-(carbonato)cuprate(II)  has  been 
analyzed  using  x-ray  diffraction  data  from  an  earlier  structure  determination.  While  the 
copper-ligand  geometry  is  close  to  square  planar  the  deformation  density  near  the  metal  is 
strongly  asymmetric.  There  are  local  maxima  near  the  copper  atom  along  the  line  of  the  Cu-K 
vectors.  These  resemble  features  found  in  corresponding  regions  in  normal  length  metal-metal 
bonds.  The  observation  is  consistent  with  the  long  range  nature  of  the  Coulomb  potential 
associated  with  the  potassium  ion. 

Keywords.  Coulomb  potential;  long  range  force;  electron  density  deformation. 


1.  Introduction 

The  Coulomb  potential  is  slowly  varying  at  long  range.  For  a  random  assembly  of 
charges  with  the  same  sign  long  range  interactions  make  a  dominant  contribution  to  the 
total  energy  because  of  their  number.  Such  terms  are  relatively  unimportant  in 
crystalline  materials  because  macroscopically  matter  is  electrically  neutral. 

Chemical  bonding  refers  to  the  equilibrium  resulting  from  the  static  interaction 
between  neutral  atoms  in  close  proximity.  The  overlapping  of  charge  clouds  alters  the 
balance  between  attractive  and  repulsive  forces  to  give  a  net  attractive  term  which 
dominates  the  energy.  The  overlap  contributions  decrease  rapidly  with  increasing 
internuclear  separation. 

The  bonding  interaction  perturbs  the  atomic  electron  density  distribution  and  the 
consequent  displaced  charge  interacts  via  the  Coulomb  potential  with  other  more 
distant  neighbouring  atoms.  It  is  sometimes  believed  that  the  latter  effect  on  the 
electron  density  should  be  far  smaller  than  the  first;  this  assumption  is  not  consistent 
with  the  different  ranges  over  which  the  contributing  forces  are  effective.  This 
consideration  is  important  for  example  in  the  choice  of  basis  functions  for  crystal 
orbital  calculations  and  for  the  interpretation  of  deformation  densities.  To  de¬ 
monstrate  the  relative  significance  of  the  long  range  of  the  Coulomb  potential  the 
deformation  density  in  a  crystal  structure  in  which  the  long  range  forces  can  be 
easily  visualized  has  been  examined.  The  crystal  structure  of  potassium  his-fcar- 
bonato)cuprate(II),  K2Cu(C03)2,  was  reported  by  Farrand  et  al  (1980).  It  contains 
alkali  metal  atoms  interacting  at  medium  range  with  oxygen  atoms  and  at  longer  range 
with  a  highly  polarizable  transition  metal  atom.  A  preliminary  examination  of  the 
electron  density  distribution  was  made  by  Figgis  et  al  (1981). 

2.  Experimental 

The  data  from  the  original  structural  study  were  used  in  the  present  investigation.  The 
intensities  were  corrected  for  absorption  using  the  program  abscor  and  reduced  to 
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1693  independent  observations.  The  refinement  in  the  earlier  charge  density  study  was 
based  on  /  rather  than  F  to  eliminate  the  bias  caused  by  setting  to  zero  those 
observations  for  which  the  experimentally  determined  intensities  were  negative.  This  is 
not  appropriate  when  the  optimum  values  of  the  structure  factor  moduli  and  their 
associated  errors  are  required  for  the  calculation  of  reliable  difference  densities  (French 
and  Wilson  1978).  To  treat  the  negative  observations  correctly  the  intensities  and  their 
estimated  standard  deviations  were  modified  using  a  Bayesian  statistics  approach 
which  combines  prior  knowledge  of  the  non-negativity  of  the  true  intensities  with  the 
intensity  distribution  determined  from  the  data  set. 

The  positional  and  thermal  parameters  were  redetermined  in  a  weighted,  full  matrix 
least  squares  refinement  using  all  1693  independent  reflections.  The  form  factors  for 
K  Cu,  C  and  O  by  Cromer  and  Mann  (1968)  and  their  respective  real  and  imaginary 
anomalous  dispersion  corrections  (Cromer  and  Liberman  1970)  were  used  and  the  data 
were  weighted  by  l/cr^(F).  The  refinement  converged  giving  the  agreement  factors 
shown  in  table  1.  The  final  structure  parameters  are  listed  in  table  2.  Interatomic 
distances  and  angles  are  given  in  table  3. 

3.  Structure 

The  structure  of  K2Cu(C03)2  has  been  discussed  by  Farrand  et  al  (1980).  It  consists  of 
almost  square  planar  copper-ligand  oxygen  moieties  linked  in  a  polymeric  array  by 
bridging  carbonate  ligands.  The  potassium  cations  are  interspersed  through  the  array. 
Although  each  copper  atom  is  on  a  crystallographic  two-fold  rotation  axis  the 
geometrical  structure  exhibits  local  four-fold  symmetry  about  this  site.  The  angles 
between  the  Cu-O(l)  and  Cu-0(2)  vectors  are  88-56(6)  and  92-00(6)°  and  the  bond 
lengths  are  1-934(1)  and  1-929(1)  A  respectively.  The  copper-ligand  atom  chromophore 
deviates  slightly  from  planarity  in  a  tetrahedral  sense  with  the  copper  atom  lying  0  06  A 
above  and  each  symmetry  related  pair  of  oxygens  lying  014  A  above  and  0-16  A  below 
the  least  squares  plane  through  these  five  atoms. 

The  geometry  of  each  bridging  carbonate  anion  differs  significantly  from  trigonal 


Table  1.  Experimental  and  refinement  data. 


Space  Group 

Fdd2 

Cell  Dimensions 

a 

11-425(3)  A 

b 

17-658(4) 

c 

6-154(2) 

u 

1241-5 

Z 

8 

/r(MoKJ 

47  cm“ ' 

Diffractometer 

Syntex  P2, 

Monochromator 

Graphite 

Maximum  26 

100’ 

Independent  reflections 

1693 

Reflections  used 

1693 

Final  R  factor  (on  F) 

0-035 

Final  weighted  R  factor  (on  F) 

0-026 

Weights  used 

1/<t^(Fo) 

Electron  density  distribution  in  K2Cu(CO^)2 
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Table  2.  Final  atomic  positional  parameters  expressed  in  fractional  coordinates  with 
estimated  standard  deviations  in  parentheses. 


x/a 

y/b 

z/c 

Cu 

0(0) 

0(0) 

0(0) 

K 

0-57808(4) 

0-20005(2) 

0-19615(9) 

0(1) 

008278(11) 

0-09545(7) 

0-01308(29) 

0(2) 

0-10135(11) 

0-19980(7) 

0-21335(35) 

0(3) 

0-05979(13) 

0-08876(9) 

0-37189(31) 

C 

0-08180(14) 

0-12671(9) 

0-20509(39) 

Table  3.  Interatomic  distances  (A)  and  an 
parentheses. 

Bond  distances 

Cu-O(l) 

1-9343(13) 

Cu-0(2)‘ 

1-9290(13) 

C-O(l) 

1-3042(29) 

C-0(2) 

1-3108(20) 

C-0(3) 

1-2514(28) 

Interatomic  distances 

Cu-K" 

3-8649(11) 

Cu-K"‘ 

3-4892(9) 

Cu-K''' 

4-0315(11) 

Bond  angles 

0(l)-Cu-0(2)' 

88-56(6) 

0(l)-Cu-0(2)" 

92-00(6) 

0(l)-C-0(2) 

116-77(20) 

0(l)-C-0(3) 

121-22(16) 

0(2)-C-0(3) 

121-98(22) 

i.  (1/4 -X,  -1/4-i-y, 

- 1/4 -fz)  ii.  (-  1/4 

iii.  (3/4 -X,  -1/4  +  y,  1/4 +  z)  iv.  (3/4 -x,  -1/4  +  y, 


-1/4  +  z) 
-3/4  +  z) 


planar.  The  bridging  bonds,  C-O(l)  and  C-0(2),  are  elongated  relative  to  the  free 
C-0(3)  bond  and  the  0(l)-C-0(2)  angle  is  compressed  from  the  ideal  120°  to  1 16-8(2)°. 

One  pair  of  K  cations,  related  by  the  two-fold  axis,  lies  only  0-27  A  from  the  Cu-ligand 
oxygen  plane,  with  the  Cu-K  vectors  of  length  3-865  A  directed  midway  between  the 
respective  Cu-O(l)  and  Cu-0(2)  bonds.  Four  other  K  cations  surrounding  the  same 
copper  atom  (two  above  at  3-489  A  and  two  below  at  4-032  A)  form  with  the  copper 
atom  a  plane  perpendicular  to  the  Cu-ligand  oxygen  plane.  All  these  distances  between 
Cu  and  K  are  short  enough  that  the  possibility  of  significant  electrostatic  interactions 
between  Cu  and  K  must  be  considered. 

4.  Electron  density 

The  difference  density  section  in  the  Cu-ligand  oxygen  plane  is  shown  in  figure  1.  While 
the  immediate  structural  environment  of  Cu  has  pseudo  four-fold  symmetry  the 
electron  density  distribution  shows  little  trace  of  such  symmetry.  There  are  two  pairs  of 
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Figure  1.  Residual  electron  density  in  the  section  containing  Cu,  0(1)  and  0(2).  Contour 
interval  is  0-2  eA”^ 


local  maxima  within  0-8  A  of  the  copper  atom  lying  between  the  Cu-O(l)  and  Cu-0(2) 
bonds.  Each  member  of  a  pair  is  related  to  the  other  by  the  two-fold  axis  but  the  pairs 
do  not  resemble  each  other  topologically.  These  features  are  not  consistent  with  the 
nearest  neighbour  geometry. 

The  difference  map  features  can  be  accounted  for  in  qualitative  terms  if  the  effects  of 
next-to-nearest  neighbours  are  considered.  In  figure  2  the  difference  density  in  the  same 
plane  extended  to  5  A  from  the  copper  atom  is  shown.  The  broader  pair  of  peaks  near 
the  copper  are  directed  towards  the  approximately  in-plane  K  atoms.  The  electron- 
deficient  regions  near  the  copper  atom  are  rotated  approximately  15°  from  the 
respective  copper-oxygen  bonds. 

Figure  3  shows  the  section  through  the  four  potassium  cations  above  and  below  the 
Cu-ligand  oxygen  plane.  Sharp  maxima  lie  directly  along  the  shortest  Cu-K  vectors, 
with  more  diffuse  regions  of  excess  density  along  the  longest  Cu-K  vectors.  Regions  of 
electron  density  excess  near  the  potassium  atoms  are  directed  approximately  towards 
the  copper  atom. 

Although  the  results  must  be  viewed  with  caution  because  of  the  phase  errors 
associated  with  the  structure  factors  of  this  non-centrosymmetric  structure,  the 
features  in  the  difference  maps  are  consistent  with  a  medium  range  electrostatic 
interaction  between  the  copper  and  alkali  metal  atoms.  The  significance  of  this  effect 
has  been  reinforced  by  the  results  of  a  charge  density  study  of  the  centrosymmetric 
sodium  analogue,  Na2Cu(C03)2  (Maslen  et  at  1983).  Features  in  that  analysis,  which 
are  not  hampered  by  potential  difficulties  of  phasing,  are  similar  to  those  described 
above.  In  Na2Cu(C03)2,  the  effects  of  next-to-nearest  neighbours  on  the  polarization 
of  the  metal  electron  density  is  observed  on  a  number  of  difference  density  sections. 
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Figure  2.  Residual  density  in  the  section  containing  Cu,  0(1)  and  0(2)  extended  to  a 
distance  of  at  least  5  0  A  from  Cu.  Contour  interval  is  0•2eA-^ 


Figure  3.  Residual  density  in  section  perpendicular  to  the  plane  in  figure  1  containing  the 
two-fold  axis  and  the  Cu  and  four  K  atoms.  Contour  interval  is  0-2  eA“^. 
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5.  Conclusions 

It  has  been  shown  that  residual  density  near  transition  metal  atoms  for  near-to-ideal 
structures  can  be  described  in  terms  of  the  effects  of  the  nearest  neighbour  geometry 
(Marumo  et  al  1974).  In  the  present  case,  however,  such  a  procedure  is  inadequate  and 
the  interpretation  must  be  extended  to  correlate  the  density  features  with  next-to- 
nearest  neighbour  interactions.  These  features  resemble  those  observed  in  normal 
length  bonds  between  metal  atoms  (Wang  and  Coppens  1976;  Mitschler  et  al  1978). 
This  suggests  that  the  Cu-K  interaction  might  be  regarded  as  a  weak  metal-metal  bond. 
Ito  and  Higashi  (1983)  have  recently  reported  similar  features  along  the  Al-Al  vectors  in 
LiAlBi4.  The  Al-Al  distance  in  that  structure  is  2-92  A. 

The  assertion  that  in  these  cases  nearest  neighbour  and  next-to-nearest  neighbour 
interactions  lead  to  effects  of  comparable  magnitude  reflects  the  slowly  varying  nature 
of  the  Coulomb  potential.  It  is  therefore  important  always  to  consider  all  interactions  in 
the  interpretation  of  residual  density  maps. 
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Crystallographic  study  of  push-pull  ethylenes 

D  ADHIKESAVALU,  NIRUPA  U  KAMATH  and  K  VENKATESAN 

Department  of  Organic  Chemistry,  Indian  Institute  of  Science,  Bangalore  560012,  India 

Abstract.  From  structural  data  of  several  crystal  structure  analyses  of  push-pull  ethylenes  it  is 
found  that  C=C  bond  lengths  in  this  class  of  molecules  are  significantly  longer  than  the  value 
for  this  bond  in  ethylene.  With  powerful  donors  and  acceptors  such  as  -NMe2  and  -COOMe 
groups  respectively,  the  C=C  bond  length  is  as  long  as  1-460  A.  Correlations  are  observed 
between  (i)  C=C  bond  lengths  and  the  twist  angles  about  the  C=C  bond  and  (ii)  C=C  bond 
lengths  and  the  torsional  barrier  measured  by  the  dnmr  technique. 

Keywords.  X-ray  structure  analysis;  push-pull  ethylenes;  C=C  bond  lengths;  rotational 
barriers. 


1.  Introduction 

Ethylene  substituted  with  electron  donating  and  electron  accepting  groups  in  the 
vicinal  positions  is  known  as  polarized  or  push-pull  ethylenes. 

The  conjugative  interaction  between  the  donor  and  acceptor  parts  of  the  molecule 
through  the  ethylene  n  bond  results  in  interesting  changes  in  chemical  and  physical 
properties  of  this  class  of  compounds.  For  example,  in  1 -dimethyl  amino-2-acetyl 
ethylene  (figure  1)  the  conjugation  gives  rise  to  resonating  structures  (I)  and  (II).  The 
C=C  bond  in  such  systems  are  resistant  to  oxidation  and  reduction  and  to  the  addition 
of  bromine  under  normal  conditions  (Jensen  and  Henricksen  1968).  Whereas  the 
barrier  to  rotation  about  the  C=C  bond  in  1,2-dideuterioethylene  is  62  kcal  mol“  ^  it  is 
substantially  reduced  in  polarized  ethylenes  (Sandstrom  and  Wennerbeck  1971;  Shvo 
and  Shanan-Atidi  1969). 

There  are  two  important  factors  which  determine  the  conformation  about  the 
ethylene  bond  in  this  class  of  molecules:  (i)  intramolecular  steric  interaction  between 
the  substituents  on  the  double  bond,  and  (ii)  loss  of  7r-electron  energy  (£„)  as  the 
rotation  about  the  C=C  bond  increases. 

The  conjugation  is  maximum  in  the  planar  state  and  as  such  this  will  favour  a  planar 
geometry.  The  steric  interaction  between  the  donor  and  acceptor  groups  is  minimum  at 
the  dihedral  angle  of  90°  and  so,  a  90°  twisted  conformation  will  be  favoured. 

Push-pull  ethylenes  can  be  represented  by  the  general  formula  I  as  shown  below 


X— c 

\l 


NMe. 


'H 


II 


Figure  1.  Resonance  structures  of  1 -dimethyl  amino-2-acetyl  ethylene. 
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where  and  D2  represent  electron-donating  groups  and  and  A2  electron  accepting 
groups. 


The  twist  about  the  C=C  double  bond  is  measured  as  the  angle  between  the  plane 
through  the  atoms  D{1),  D{2)  and  C(l)  and  the  plane  through  the  atoms  ^(1),  A{2)  and 
C(2). 

Although  results  based  on  nmr  and  other  techniques  have  been  analysed  in  literature 
(Sandstrom  1983),  no  such  detailed  discussion  of  the  x-ray  results  has  been  reported.  In 
recent  years,  we  have  been  investigating  the  molecular  structures  of  a  large  number  of 
push-pull  ethylenes  via  x-ray  crystallography  and  in  this  paper  we  discuss  our  findings. 

2.  Results  and  discussion 

In  table  1  are  recorded  the  bond  lengths  of  the  ethylene  bond  and  the  twist  angles  for 
the  push-pull  ethylenes  investigated  in  our  laboratory  (Adhikesavalu  and  Venkatesan 
1981, 1982, 1983a,  b;  Nirupa  Kamath  and  Venkatesan  1983)  and  those  by  Abrahamsson 
et  al  (1974).  The  present  results  of  the  push-pull  ethylenes  are  based  on  the  intensity 
data  collected  on  a  Nonius  cad-4  diffractometer  and  the  final  R-factors  are  in  the  range 
4-5-8  %  with  the  exception  of  14  in  table  1  for  which  the  R-value  at  the  present  stage  of 
refinement  is  around  10%.  In  most  of  the  cases  investigated,  the  samples  were  kindly 
provided  by  Professor  J.  Sandstrom.  Also  recorded  in  this  table  are  the  values  of  the 
rotational  barrier  about  the  C=C  bond  measured  using  the  dnmr  technique 
(Sandstrom  1983).  It  may  be  noted  that  in  molecules  1.  to  4  (table  1),  whereas  the  donor 
groups  differ,  the  acceptor  groups  remain  the  same. 

Comparing  \  and  2,  on  the  basis  of  electron  donating  capacity,  the  presence  of  two 
dimethyl  amino  groups  in  j_  would  be  expected  to  lead  to  a  longer  C=C  length  than  in  2 
in  which  the  donor  groups  are  -SMe  and  -NMe  groups.  But  this  is  not  borne  out  by  the 
x-ray  results.  The  structural  evidence  shows  that,  due  to  steric  interaction  the  N-methyl 
groups  are  twisted  by  33°  in  1^  whereas  in  2  it  is  of  lower  magnitude  (23°)  with  respect  to 
the  plane  defined  by  the  two  donor  atoms  and  the  olefinic  carbon  atoms.  The  twist 
about  the  C=C  bond  is  29-3°  in  and  21-7°  in  2.  Comparing  the  molecule  ]_  having  an 
acyclic  donor  with  3  containing  a  cyclic  donor,  we  observe  that  the  C=C  bond  length  is, 
within  experimental  error,  same  although  there  is  a  difference  of  about  9°  in  the  twist 
angle.  Cyclisation  in  3  and  14  has  reduced  the  steric  interaction  between  the  donor  and 
acceptor  parts  of  the  molecule.  Similarly,  the  larger  twist  in  4  (3 1-5°)  compared  to  that  in 
3  (20-2°)  and  14  (22-9°)  is  due  to  the  greater  steric  interaction  in  the  planar  state  in  4.  The 
average  value  of  the  exocyclic  angle  (i.e.  C2-C1-D1  and  C2-C1-D2)  in  4  is  significantly 
smaller  (120-6(5)°)  than  in  3  (124-8(2)°).  However,  the  significantly  longer  C=C  bond 
length  in  4(  1  -429(6)  A)  compared  to  that  in  3  (1  -407(3)  A)  cannot  be  due  to  the  difference 
of  about  1 1  °  in  twist  angles.  In  this  connection  it  is  noteworthy  that  the  difference  in  the 
twist  angles  between  5  and  7  is  about  16°  (table  1)  but  there  is  no  significant  difference 
in  the  C=C  bond  length.  These  observations  seem  to  indicate  that  the  hexahydropyri- 
midine  ring  possesses  better  electron  donating  ability  than  the  five-membered 
imidazolidine  ring. 
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Table  1. 


SI.  No.  & 
Reference 
code 

Structurol  formula 

Rotational 
barrier 
about  the 
C»»C  bond 
(k cols  mole"'' ) 

Twist  about 
the  C=C 
bond  ( ® ) 

C=»C  bond 
length  (A) 

1 

Me2N  /CN 

Me2N  CN 

— 

29.3 

1.407  (4) 

2 

MeS  /CN 

Me2N^  ^CN 

13.2 

21.7 

1.397  (8) 

3 

Me 

' — N^  '^CN 

1 

Me 

— 

20.2 

1.407  (3) 

4 

Me 

/-K 

N^  \CN 

1 

Me 

— 

31.5 

1.429  (6) 

5 

Me2Ns^  /COMc 

Me2N/  ^COOMc 

<8 

56.8 

1.461  (2) 

6 

H 

1 

1 — N-  /COMe 

^C=C 

1 K  ^COMe 

1 

H 

— 

5.0 

1.451  (3) 

7 

8 

Me 

1 

] — N^  /COMe 

1 - N'^  ^COMe 

1 

Me 

Me 

1 

1 - N  /COMe 

L 

' - N  COOMc 

1 

Me 

<8 

<8 

72.9 

62.6 

1.468  (3) 

1-464  (2) 

9 

MeS  /CN 

C  =  C 

Me2N''  '^CPh 

0 

— 

38.3 

1-414  (4) 

10 

Me 

/CN 

C=s=C 

'—N'^  '^CPhBr 

1  1 

Me  O 

9-5 

41.0 

1-448  (4) 

11 

MeSv  /CN 

C4S=C‘"^ 

MeS'^  '^CPhBr 

II 

0 

19.0 

15.0 

1.369  (7) 

12 

MeSv^  yU02 

C  =  C 

MeS'^  ^CN 

19.1 

12-8 

1-377  (3) 

13 

0 

II 

m«2^\ 

^c=.c 

Me2N^  '^Ph 

9.8 

34.8 

1-409  (4) 

u 

Me 

1— /P*' 

c=c 

1 — N-^  \ 

1  CN 

Me 

9  5 

22  9 

1-415  (8) 

E.S.d’s  are  given  in  parenthesis.  References:  X-ray  work  on  1-4,  6-9  (Adhikesavalu  and 
Venkatesan);  5,  12-14  (Nirupa  Kamath  and  Venkatesan).  RotaTionaT  barriers  on  5,  7,  8,  13 
(Wennerbeck  and  Sandstrom  1972);  10, 11  (Abrahamsson  et  al  1974);  12  (Dreier  et  al  1978);  14 
(Sandstrom  et  al  1977);  2  (Sandstrom  and  Wennerbeck  1978). 


162 


D  Adhikesavalu,  Nirupa  U  Kamath  and  K  Venkatesan 


The  barrier  to  rotation  in  ^  could  not  be  measured  due  to  symmetry  reasons  (Kessler 
1968).  Also,  the  barriers  to  rotation  for  3  and  4  have  not  been  reported.  But,  for  similar 
systems  the  barrier  is  reported  to  be  less  than  lOkcals/mol  (Sandstrom  and 
Wennerbeck  1971).  However,  nmr  studies  have  shown  that  the  energy  barrier  about  the 
C=C  bond  is  less  in  molecules  with  electron  donating  groups  present  as  part  of  a  ring 
than  in  those  with  acyclic  systems.  The  barrier  to  rotation  in  2  seems  to  indicate  that  the 
transition  state  is  stabilized  better  in  this  molecule  compared  to  1. 

The  combined  effect  of  the  powerful  electron  donating  -NMe2  group  and  the  strong 
acceptor  combination  of  acetyl  and  carbomethoxy  groups  in  5  to  8  produce  a 
remarkable  lengthening  of  the  C=C  bond  (1-460(3)  A).  The  slightly  smaller  value  of 
C=C  bond  length  in  6  ( 1  -45 1  (3)  A)  compared  to  the  values  obtained  for  5,  7  and  8  may 
be  attributed  to  the  considerably  smaller  angle  of  twist  in  6.  It  is  pertinent  to  observe 
that  the  theoretical  calculations  by  mindo/2  method  (Dewar  and  Haselbach  1970)  have 
shown  that  the  C=C  bond  length  extension  in  ethylene,  on  twisting  about  the  C—C 
bond  by  90°  is  only  0-03  A.  The  C=C  bond  length  calculated  for  90°  twisted  state  is 
1-367  A  compared  to  the  normal  value  of  1-336(2)  A  (Kuchitsu  1966). 

There  would  be  severe  steric  interaction  involving  -NMe2  groups  at  the  donor  end 
and  the  acetyl  groups  at  the  acceptor  end  in  the  planar  state  in  7  and  8.  To  relieve  this 
strain,  the  molecules  are  twisted  72-9°  in  7  and  62-6°  in  8  about  the  C=C  bond.  The 
carbomethoxy  group  introduces  less  steric  strain  compared  to  an  acetyl  group  and  the 
lesser  twist  in  8  may  be  due  to  this.  In  the  absence  of  the  -NMe2  group,  we  should 
expect  the  twist  angle  to  be  less,  but  the  unexpectedly  low  value  of  5°  in  molecule  6  is 
attributed  to  the  two  intramolecular  N-H  .  .  .  O  bonds  (figure  2).  In  fact  the  twist 
angle  corresponding  to  the  minimum  energy  conformation,  calculated  using  semi- 
empirical  methods  (Ramachandran  and  Sasisekharan  1968),  is  22°.  However,  when  the 
contribution  of  the  hydrogen  bonding  energy  is  introduced  in  the  conformational 
energy  calculations,  the  conformation  with  a  twist  angle  of  8°  is  favoured. 

H  NMR  spectra  of  7  under  conditions  of  slow  exchange  shows  two  forms,  one  with 
diastereotopic  and  the  other  with  homotopic  -COMe  protons.  The  first  must  be 


Figure  2.  Intramolecular  N-H  .  . .  O  bonding  in  6. 
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degenerate  EZ-ZE  pair,  whereas  the  second  may  be  EE  or  ZZ  (figure  3).  But  the  EE 
form  is  less  likely  because  of  the  strong  repulsion  to  be  expected  between  parallel 
dipoles  as  shown  by  the  effect  of  solvent  polarity  and  in  particular  by  the  asis 
(Sandstrom  1983). 

In  the  solid  state,  6  adopts  the  ZZ  conformation  favoured  by  the  intramolecular 
hydrogen  bonding.  In  5,  7  and  8  the  acceptor  groups  assume  EZ  conformation. 

The  alkylthio  groups  have  been  found  to  be  poor  donors  and  this  is  clearly  seen  in  the 
shorter  C=C  bond  length  (1-369(7)  A)  in  IT.  Comparison  of  the  C=C  lengths  obtained 
for  9  and  shows  again  that  the  thio  group  is  a  poor  donor.  As  expected  then,  the 
ketene  mercaptals  being  weak  donors,  the  bond  length  in  12  is  1-377(3)  A.  In  both  cases 
the  rotational  barrier  has  been  measured  and  is  of  the  order  of  19  kcalsmoP^ 

The  combination  of  an  acetyl  and  phenyl  group  in  lA  as  acceptor  groups  does  not 
seem  to  produce  the  same  degree  of  delocalization  as  the  acetyl-carbomethoxy 
combination  does  as,  for  example,  observed  in  5  with  the  same  donor  substituents  in 
both  the  molecules. 

3.  Conclusions 

From  the  present  x-ray  crystallographic  results  the  following  significant  conclusions 
may  be  drawn: 

(i)  The  larger  C=C  bond  length  observed  in  molecules  5  to  8  compared  to  those  of  1^  to 
4  clearly  shows  that  the  -COMe,  -COMe  or  -COMe,  -COOMe  substituents  as 
acceptors  are  definitely  better  than  the  -CN  group  as  acceptors.  Further,  as  noted 
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Figure  3.  Conformational  possibilities  of  acetyl  group. 
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earlier,  the  hexahydropyrimidine  ring  appears  to  be  a  better  donor  than  the 
imidazolidine  ring. 

(ii)  In  figure  4,  the  C=C  bond  lengths  are  plotted  against  the  observed  twist  angle  about 
the  C=C  bond.  It  is  observed  that  the  nature  of  the  correlation  for  the  molecules  in 
which  the  donors  are  acyclic  is  different  from  that  for  those  with  cyclic  donors.  For  the 
same  angle  of  rotation,  the  C=C  bond  length  tends  to  be  longer  in  the  molecules  with 
cyclic  donors  than  the  ones  with  acyclic  donors.  However,  for  angles  of  rotation  above 
55°,  the  C=C  bond  lengths  tend  to  be  very  nearly  equal  in  both  the  cases.  The  deviation 
of  molecule  6  from  the  curve  (b)  arises  from  the  intramolecular  N-H  .  .  .  O  hydrogen 
bonding  in  this  molecule.  The  observed  difference  depicted  in  figure  4  with  acyclic  (a) 
and  cyclic  (b)  donors  could  be  partly  due  to  the  difference  in  steric  factors  between  these 
two  classes  of  molecules  although  it  is  known  from  theoretical  calculations  (Dewar  and 
Haselbach  1970)  that  the  variation  expected  in  C=C  bond  length  from  the  twist  about 
this  bond  is  less  than  0  03  A.  Thus  the  importance  of  the  possible  difference  in  the 
electronic  factor  between  the  cyclic  and  acyclic  donors  cannot  be  overruled. 

(iii)  It  is  reasonable  to  expect  a  correlation  between  d  (Cjp2-N)  and  d  (C=C).  However, 
it  is  found  from  the  scatter  plot  of  C=C  and  C(sp^)-N,  that  there  is  a  correlation 
corresponding  to  cyclic  donors  molecules  but  not  when  the  donors  are  acyclic 
(figure  5).  In  the  molecules  with  acyclic  donors  such  as  -NMe2  and  -SMe,  there  is 
rotation  about  N-Me  and  S-Me  bonds  in  order  to  overcome  the  steric  interaction 
between  the  donor  and  acceptor  groups.  However,  this  is  not  possible  to  the  same 
degree  on  the  molecules  with  cyclic  donors.  This  provides  an  explanation  for  the 
absence  of  correlation  in  molecules  with  acyclic  donors. 

(iv)  It  was  pointed  out  earlier  that  the  rotational  barrier  about  the  C=C  bond  in 


Figure  4.  Plot  of  C=C  bond  length  (A)  vs  twist  about  the  C=C  bond  (°).  A  acyclic  donors 
and  o  cyclic  donors. 
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Figure  5.  Correlation  plot  of  rf(Cjp2-N)  vs  d{C=C)  (A). 
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Figure  6.  Plot  of  rotational  barrier  (kcal  mol  ')  vs  C—C  bond  length  (A). 


polarized  ethylenes  is  substantially  reduced.  From  the  x-ray  results  reported  here  it  is 
seen  that  the  C=C  bond  length  can  be  as  large  as  1-460  A.  The  rotational  barrier  value 
may  be  expected  to  decrease  as  the  C=C  bond  length  increases.  The  rotational  barrier 
values  for  quite  a  few  cases  are  not  available  from  the  nmr  data  either  because  the  low 
torsional  barriers  of  C=C  double  bond  are  sometimes  below  the  limit  for  dnmr 
technique  ca  5  kcal  mole"*  for  these  type  of  compounds  or  due  to  experimental 
difficulties. 

In  figure  6  are  plotted  C=C  bond  lengths  vs  the  rotational  barriers.  The  donor  and 
acceptor  groups  being  different  in  these  molecules,  the  intramolecular  steric  strains  will 
be  different.  However,  there  is  a  remarkable  correlation  between  C=C  bond  length  and 
rotational  barrier. 
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(v)  If  we  make  the  reasonable  assumption  that  the  electronic  effects  are  more 
important  in  lengthening  the  C=C  bond,  then  from  the  crystallographic  data  recorded 
in  table  1  for  molecules  3,  6,  7,  8  and  ^  we  arrive  at  the  conclusion  that  the  electron 
withdrawing  power  of  groups  are  in  the  order 

-COMe  w  -COOMe  >  Ph-C=0  >  CN. 

(vi)  It  has  been  shown  from  a  feasibility  study  (Trueblood  and  Dunitz  1983)  that  from 
the  internal  molecular  motions  in  crystals  an  estimation  of  force  constants,  frequencies 
and  barriers  from  diffraction  data  can  be  obtained.  The  rotational  barriers  from  nmr 
measurements  are  available  (Sandstrom  1983)  for  many  of  the  polarized  ethylenes  and 
the  x-ray  data  are  based  on  diffractometric  measurements.  It  was,  therefore,  tempting  to 
use  the  thermal  motion  parameters  for  the  calculation  of  the  rotational  barriers.  From 
the  point  of  view  of  Hirshfeld’s  (1976)  ‘rigid-bond’  test  only  three  cases  (2,  5,  13  in 
table  1)  appeared  reasonable  for  the  estimation  of  rotational  barriers.  But  it  was  very 
disconcerting  to  note  that  the  rotational  barrier  values  obtained  even  for  these  were 
widely  different  from  those  obtained  from  the  nmr  measurements.  Although  crystal 
environment  may  be  responsible  to  some  degree  for  the  discrepancy  it  is  clear  that  far 
more  accurate  intensity  data  than  we  have  now  are  required  to  recover  the  pin  from  the 
haystack. 
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Structural  evidence  for  the  radical  scavenging  mechanism  of  some 
aminothiol  group  of  radioprotectants 

KALYANI  VIJAYAN,  B  M  VEDAVATHI  and  A  MANI 

Materials  Science  Division,  National  Aeronautical  Laboratory,  Bangalore  560017,  India 

Abstract.  Aminothiols  constitute  an  important  group  of  radioprotectants.  The  structures  of  a 
few  well-known  compounds  belonging  to  the  family  of  radioprotectants  have  been  determined 
by  single  crystal  x-ray  diffraction  methods.  The  sulphur  and  the  amino  nitrogen  atoms  are 
separated  by  two  tetrahedral  carbon  atoms  in  these  compounds.  The  gauche  conformation  of 
the  sulphur  and  the  nitrogen  atoms  and  the  consequent  non-bonded  intramolecular  S  .  .  .  N 
interaction  observed  in  some  of  the  crystal  structures  appear  to  favour  the  hypothesis  that  the 
protective  mechanism  of  these  compounds  is  by  free  radical  scavenging. 

Keywords.  Radioprotectants;  aminothiols;  crystal  structure;  radical  scavenging;  S  .  .  .  N 
interaction. 


1.  Introduction 

It  is  well-known  that  on  exposure  to  ionising  radiation,  biological  systems  incur  severe 
damages  affecting  their  properties  and  functions.  Dale  (1942)  showed  that  the 
hazardous  effects  of  radiation  on  biological  systems  may  be  reduced  by  administering 
some  chemicals  prior  to  exposure.  Since  then,  a  large  number  of  such  chemicals  known 
as  ‘radioprotectants’  have  been  identified.  With  the  rapid  advances  in  space  technology, 
chemical  radioprotection  has  gained  importance  and  extensive  research  is  being  carried 
out  to  understand  the  mechanism  of  action  of  chemical  radioprotectants.  Some  of  the 
well-known  theories  of  radioprotection  involve  (i)  the  method  of  radical  scavenging 
(ii)  formation  of  mixed  disulphide  (iii)  production  of  tissue  hypoxia  (iv)  donation  of 
hydrogen  atoms  (v)  binding  or  chelating  to  the  metal  ions  in  the  biological  system  and 
(vi)  introduction  of  energy  traps.  An  essential  pre-requisite  for  understanding  the 
mechanism  of  radioprotection  is  the  study  of  molecular  geometry  of  the  radioprotect¬ 
ants.  In  this  context,  we  have  analysed  the  structures  of  a  few  radioprotectants 
belonging  to  the  aminothiol  family  by  single  crystal  x-ray  diffraction  methods.  The 
results  obtained  are  critically  examined  with  reference  to  a  particular  theory  of 
radioprotection. 

2.  Aminothiol  radioprotectants 

Sulphur  atoms  occurring  in  the  compounds  in  the  aminothiol  family  could  belong  to  a 
-SH  group  or  to  a  disulphide  bond  or  could  be  bonded  to  two  carbon  atoms.  It  is, 
however,  believed  that  the  non-sulphydryl  compounds  are  converted  into  the 
appropriate  sulphydryl  forms  on  administration  and  that  a  free  sulphydryl  group  is 
essential  for  the  radioprotective  action  (Shapira  et  al  1957;  Doherty  et  al  1957a; 
Schwartz  and  Shapiro  1960;  Doherty  and  Shapira  1963).  Most  aminothiol  radioprotect- 
tants  contain  one  or  more  amino  groups,  but  they  could  be  replaced  by  guanidyl  groups 
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without  loss  of  activity.  In  fact,  j5-mercapto  ethyl  guanidine,  a  typical  radioprotectant  is 
known  to  be  more  active  than  ^-mercapto  ethylamine  (Shapira  et  al  1957).  The 
aminothiols  have  been  known  to  possess  a  high  degree  of  structural  specificity  with 
regard  to  their  radioprotective  property.  In  particular,  the  relative  orientation  of  the 
amino  (or  guanidyl)  and  the  thiol  groups  is  believed  to  be  important  in  their  mechanism 
of  action. 

3.  X-ray  results 

The  bromide  hydrobromide  of  S,  2-aminoethyl-isothiouronium  (aet)  (I),  is  an  effective 
radioprotectant  belonging  to  the  aminothiol  group.  A  perspective  view  of  the  aet 
molecule  as  derived  from  the  x-ray  analysis  is  given  in  figure  1  (Vijayan  and  Mani  1977). 
The  terminal  amino  group  and  the  sulphur  atom  in  the  aet  molecule  are  trans  about  the 
connecting  C-C  bond.  This  conformation  eliminates  the  possibility  of  any  intra¬ 
molecular  S  .  .  .  N  interaction. 

When  administered,  aet  is  known  to  undergo  the  chemical  changes  shown  in  figure  2 


N(1) 


Figure  1.  A  perspective  view  of  the  aet  molecule. 
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Figure  2.  Formation  of  2-at  and  ged  from  aet. 
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(Khym  et  al  1957;  Doherty  et  al  1957b).  The  two  end  products,  viz,  2-aminothiazoljne 
(IV)  (2-at)  and  bis  (2-guanidinoethyl)  disulphide  (V)  (ged),  are  obtained  via  the  cyclic 
intermediate  II,  which  could  either  open  to  form  mercaptoethyl  guanidine  (III),  (meg), 
at  pH  7  or  split  off  ammonia  to  form  2-at  at  pH  2-5.  At  pH  7  and  above,  meg  is  unstable 
and  is  readily  oxidized  by  air  to  form  the  disulphide  ged  .  ged  ,  the  crystal  structure  of 
which  has  been  analysed  (Vedavathi  and  Vijayan,  to  be  published)  is  also  known  to 
protect  against  ionising  radiation  (Distefano  et  al  1959;  Schwartz  and  Shapiro  1960). 

A  view  of  the  ged  molecule  as  seen  along  the  S-S  bond  is  given  in  figure  3.  The  two 
halves  of  the  ged  molecule,  though  chemically  identical,  have  strikingly  different 
structural  characteristics.  The  sulphur  and  the  nitrogen  atoms  S(2)  and  N(4)  in  one  half 
of  the  GED  molecule  are  gauche,  leading  to  an  intramolecular  non-bonded  S  .  .  .  N 
distance  of  3-46  A.  However,  in  the  other  half  of  the  molecule,  the  corresponding 
sulphur  and  nitrogen  atoms  S(l)  and  N(l)  are  trans  about  the  connecting  C-C  bond. 

Cystamine  (VI)  is  obtained  by  reduction  of  the  well-known  aminothiol  radioprotect- 


S-CH2-CH2-  NHj 
S-CH2-CH2-NH2 
(YD 


ant  cysteamine  or  /3-mercapto  ethylamine  (mea).  Cystamine  di  HCl,  the  crystal  structure 
of  which  has  been  studied  (Vedavathi  and  Vijayan  1979),  is  known  to  provide  the  same 


Figure  3.  A  view  of  the  GED  molecule  as  seen  along  the  disulphide  bond. 
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N(2) 


Figure  4.  A  view  of  the  cystamine  molecule  as  seen  along  the  disulphide  bond. 


order  of  radioprotection  as  cysteamine  itself  (Doherty  et  al  1957a).  The  cystamine 
molecule  is  essentially  two  ethylamine  groups  connected  by  a  central  disulphide  bond. 
The  only  difference  between  the  chemical  structures  of  cystamine  and  ged  is  the 
replacement  of  amino  groups  in  the  former  by  the  guanidine  groups  in  the  latter.  A 
perspective  view  of  the  cystamine  molecule,  as  seen  along  the  disulphide  bond,  is  given 
in  figure  4.  As  in  the  ged  molecule,  the  chemically  identical  halves  of  the  cystamine  mol¬ 
ecule  have  different  structural  characteristics.  In  one  half,  the  sulphur  and  the  nitrogen 
atoms,  S(l)  and  N(l)  are  gauche  leading  to  an  intramolecular  S  .  .  .  N  distance  of 
3-22  A.  In  the  other  half  of  the  molecule,  the  S(2)  and  the  N(2)  atoms  are  trans, 
eliminating  any  possible  S  .  .  .  N  interaction. 

As  shown  in  figure  2,  one  of  the  end  products  obtained  from  aet  is  2-at.  Even  though 
2-at  is  known  to  be  the  toxic  component,  it  has  also  been  found  to  offer  radioprotection 
(Distefano  et  al  1959;  Shapira  et  al  1957).  The  crystal  structure  of  the  hydrochloride  of 
2-at  has  also  been  determined  (Vedavathi  and  Vijayan  1981).  However,  the  structural 
characteristics  of  2-at  are  not  relevant  to  the  present  discussion. 

4.  Discussion 

4.1  S  ...  N  interaction  in  the  crystal  structures 

The  striking,  common  features  in  the  crystal  structures  of  ged  Br  •  HBr  and  cystamine 
di  HCl  are  the  gauche  conformation  of  the  sulphur  and  the  nitrogen  atoms  and  the 
consequent  non-bonded  S  .  .  .  N  interaction,  observed  in  one  half  of  the  molecules. 
Similar  gauche  conformation  and  a  non-bonded  S  .  .  .  N  interaction  have  also  been 
observed  in  the  crystal  structure  of  another  aminothiol  radioprotectant,  cysteamine 
mono  hydrochloride  (mea  ■  HCl),  (Jandacek  and  Swartz  1970).  The  repeated  occur¬ 
rence  of  the  non-bonded  S  .  .  .  N  interaction  in  the  three  radioprotectants  strongly 
suggests  that  these  features  may  be  related  to  their  mechanism  of  protection. 

4.2  Radical  scavenging  mechanism  and  S  ...  N  interaction 

The  radical  scavenging  mechanism  is  based  on  the  indirect  action  theory  of  radiation.  It 
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is  well-known  that  radiolysis  of  water  produces  the  radicals  H°,  OH°  and  OOH°,  which 
are  free  to  interact  with  the  biological  system  exposed  to  radiation  and  cause  radiation 
damage.  However,  according  to  the  radical  scavenging  theory,  when  a  chemical 
radioprotectant  such  as  an  aminothiol  is  included  in  the  system  exposed  to  radiation, 
the  radioprotectant  preferentially  competes  for  the  radiation  induced  radicals  from 
water,  before  the  free  radicals  enter  into  any  further  action. 

Doherty  and  Shapira  (1957)  have  studied  the  radioprotective  property  of  several 
aminothiols  and  have  put  forth  the  hypothesis  that  both  cysteamine  and  cystamine  are 
protective  by  virtue  of  their  interaction  with  the  radiation  induced  free  radicals  leading 
to  the  formation  of  resonance  stabilised  structures.  According  to  them,  the  radical 
scavenging  action  of  the  aminothiol,  say,  cysteamine,  (mea),  takes  place  as  follows.  The 
radioprotectant  interacts  with  the  free  radicals  HOO°  and  OH°  to  produce  the  radicals 
of  MEA,  as  shown  in  (1). 

MEA  +  H00°^MEA°  +  H202, 

MEA  +  OH°  ^  MEA°  -H  H2O.  ^  ’ 

The  MEA°  radicals  may  enter  into  further  interaction  with  the  free  radicals  from  water 
and  regenerate  into  MEA  itself  (2). 

MEA°  +  HOO°  ^  MEA  +  O2, 

MEA°  +  0H°->MEA  +  0. 

The  resonance  structures  of  the  radicals  described  in  (1)  are  shown  in  figure  5. 

It  is  obvious  that  in  this  mechanism  of  radical  scavenging,  the  stability  of  the 
resonance  hybrids  is  an  important  factor.  This  stability  arises  mainly  from  the 
interaction  between  the  amino  and  the  thiol  groups,  indicated  by  dotted  lines  in 
figure  5.  The  gauche  conformation  and  the  S  .  .  .  N  interaction  observed  in  the  crystal 
structures  of  cystamine,  cysteamine  and  ged  appear  to  provide  evidence  in  support  of 
the  formation  of  resonance  stabilised  structures  and  thus  favour  the  hypothesis  that 
these  compounds  act  by  the  radical  scavenging  mechanism. 

The  stability  of  the  resonance  hybrids  is  also  controlled  by  the  length  of  the  carbon 
chain  between  the  amino  group  and  the  sulphur  atom.  In  cystamine,  cysteamine 
and  GED,  there  are  only  two  carbon  atoms  between  the  sulphur  and  the  nitrogen  atoms 
thus  leading  to  the  formation  of  a  pseudo-five-membered  ring  which  is  known  to  be 
structurally  stable. 

Absence  of  the  intramolecular  S  .  .  .  N  interaction  in  the  crystal  structure  of  aet,  a 
very  effective  aminothiol  group  of  radioprotectant,  is,  however,  consistent  with  the 
results  suggesting  that  aet,  as  such,  is  inactive  and  when  administered,  gets  converted  to 
its  active  form,  mercaptoethyl  guanidine,  meg. 


CH2-CH2  CH2-CH2 


0 


Figure  5.  Resonance  structures  of  the  mea  radical. 
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Structure  and  interactions  of  aspirin>like  drugs 


M  VIJAYAN 

Molecular  Biophysics  Unit,  Indian  Institute  of  Science,  Bangalore  560012,  India 

Abstract.  A  pre-requisite  for  the  elucidation  of  the  mechanism  of  action  of  aspirin-like  drugs, 
which  are  believed  to  exert  their  pharmacological  effects  through  the  inhibition  of 
prostaglandin  biosynthesis,  is  an  understanding  of  their  molecular  geometry,  the  non-covalent 
interactions  they  are  likely  to  be  involved  in,  and  the  geometrical  and  the  electronic 
consequences  of  such  interactions.  This  has  been  sought  to  be  achieved  through  the  x-ray 
analysis  of  these  drug  molecules  and  their  crystalline  complexes  with  other  suitable  molecules. 
The  results  obtained  from  such  studies  have  been  discussed  in  terms  of  specific  typical 
examples.  For  instance,  antipyrine  can  form  metal  and  hydrogen-bonded  complexes; 
phenylbutazone  can  form  ionic  complexes  with  basic  molecules.  Complex  formation  is 
accompanied  by  characteristic  changes  in  the  molecular  geometry  and  the  electronic  structure 
in  both  the  cases.  The  results  obtained  so  far  appear  to  indicate  that  the  important  common 
invariant  structural  features  of  the  fenamates,  deduced  from  crystal  structures,  are  retained 
even  when  complexation  takes  place. 

Keywords.  Aspirin-like  drugs;  prostaglandin  biosynthesis;  non-covalent  interactions;  crystal¬ 
line  complexes;  pyrazole  derivaties;  fenamates. 


1.  Introduction 

Nonsteroid  anti-inflammatory  analgesics  like  salicylates,  aniline  derivatives,  pyrazole 
derivatives  and  fenamates,  which  are  effective  in  varying  degrees  against  pain, 
inflammation  and  fever,  have  been  among  the  most  commonly  used  drugs  for  several 
decades.  Hardly  anything  was  known  about  the  molecular  mechanism  of  their 
pharmacological  action  until  the  early  seventies  when  it  was  discovered  that  all  these 
drugs,  despite  their  widely  different  chemical  structures,  inhibit  prostaglandin  bio¬ 
synthesis  (Vane  1971).  They  are  now  believed  to  act  through  the  inhibition  of  the  system 
of  enzymes  responsible  for  the  biosynthesis  of  prostaglandins  (Flower  1974).  In  this 
context,  a  pre-requisite  for  the  elucidation  of  the  molecular  details  of  the  action  of  these 
compounds  is  a  thorough  understanding  of  (a)  their  molecular  geometry,  (b)  the  non- 
covalent  interactions  they  are  likely  to  be  involved  in  and  (c)  the  changes  introduced  in 
their  molecular  and  electronic  structure  as  a  result  of  these  interactions.  One  of  the 
x-ray  crystallographic  projects  in  our  laboratory  seeks  to  achieve  this  through  the  x-ray 
analysis  of  these  drug  molecules  as  well  as  their  crystalline  complexes  with  other 
suitable  molecules.  The  focus  of  attention  in  these  studies  has  been  on  pyrazole 
derivatives  and  fenamates,  and  the  structures  analysed  include  drugs  such  as  antipyrine 
(Singh  and  Vijayan  1973),  amidopyrine  (Singh  and  Vijayan  1976),  metamizol  (analgin 
or  novalgin)  (Krishna  Murthy  et  ai  1979),  phenylbutazone  (Singh  and  Vijayan  1977), 
oxyphenbutazone  (Krishna  Murthy  and  Vijayan  1981a),  niflumic  acid  (Krishna 
Murthy  and  Vijayan  1979),  meclofenamic  acid  (Krishna  Murthy  and  Vijayan  1981b), 
flufenamic  acid  (Krishna  Murthy  et  al  1982)  and  some  of  their  complexes  (Singh  and 
Vijayan  1974,  1977;  Dhanaraj  and  Vijayan  1983).  These  analyses  have  produced  a 
wealth  of  information  on  the  structure  and  interactions  of  the  concerned  drug 
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molecules,  but  we  shall  confine  ourselves  here  to  a  few  representative  results  to  illustrate 
the  type  of  information  that  could  be  obtained  from  studies  of  this  nature. 

2.  Pyrazole  derivatives 

2.1  Pyrazolone  derivatives 

Pyrazole  derivatives  with  aspirin-like  action  can  be  classified  into  two  groups,  namely, 
pyrazolone  derivatives  and  pyrazolidinedione  derivatives.  Antipyrine  (l-phenyl-2,3- 
dimethyl-5-pyrazolone)  (I)  was  the  first  pyrazolone  derivative  to  be  introduced  into 


^6^5 


(I) 


medical  practice,  and  the  better  known  amidopyrine  and  metamizol  (analgin  or 
novalgin)  are  its  derivatives.  It  is  therefore  perhaps  appropriate  to  take  antipyrine  as  an 
example  of  the  drugs  in  this  family. 

The  molecular  structure  of  antipyrine,  as  derived  from  the  x-ray  analysis  of  its 
crystals,  is  shown  in  figure  1  (Singh  and  Vijayan  1973).  The  five-membered  and  the  six- 
membered  rings,  connected  through  a  N-C  bond,  are  inclined  with  respect  to  each 
other  at  52°.  The  two  nitrogen  atoms  are  pyramidal,  and  the  phenyl  and  the  methyl 
groups  attached  to  them  lie  on  the  opposite  sides  of  the  plane  of  the  five-membered 


0(S) 


Figure  1.  Geometry  of  the  antipyrine  molecule  (Singh  and  Vijayan  1973). 
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(A) 


^6^5 

(B) 


CeHg 

(C) 


Figure  2.  Canonical  structures  of  antipyrine. 


ring.  To  a  good  approximation,  the  molecule  can  be  considered  to  be  a  resonance 
hybrid  of  the  three  canonical  forms  shown  in  figure  2.  The  observed  bond  lengths  in  the 
molecule  can  be  satisfactorily  explained,  using  standard  bond  length-bond  order 
curves,  if  the  contributions  of  A,  B  and  C  are  taken  to  be  66,  22  and  12  %  respectively. 

The  crystal  structures  of  a  number  of  metal  complexes  of  antipyrine  are  available  and 
coordination  is  achieved  in  alt  of  them  through  the  carbonyl  oxygen  atom  (Vijayan  and 
Viswamitra  1966,  1967,  1968;  Cingi  et  al  1972;  Brassy,  et  al  1974a,  b,c,d).  The  bond 
lengths  and  angles  in  the  pyrazolone  moiety  in  the  metal  complexes  have  similar  values 
in  general.  But  they  differ  systematically  from  those  found  in  free  antipyrine.  It  turns 
out  that  the  observed  bond  lengths  in  the  metal  complexes  can  be  accounted  for  if  the 
contributions  from  the  canonical  forms  A,  B  and  C  (figure  2)  are  taken  to  be  41,  37  and 
22  %  respectively.  Thus,  while  the  neutral  form  predominates  in  free  antipyrine,  the 
polar  forms  do  in  the  metal  complexes.  Another  difference  between  the  molecular 
geometries  of  antipyrine  in  the  uncomplexed  state  and  in  the  metal  complexes  pertains 
to  the  hybridisation  state  of  the  hetero  nitrogen  atoms.  The  two  nitrogen  atoms  are 
more  pyramidal  in  the  free  molecule  than  in  the  metal  complexes.  Consequently,  the 
geometry  of  the  molecule  is  substantially  different  in  the  two  cases.  This  is  illustrated  in 
figure  3  which  shows  the  uncomplexed  molecule  as  well  as  that  in  a  metal  complex, 
viewed  along  comparable  directions.  This  represents  an  interesting  situation  in  which  a 
perturbation  at  one  end  of  the  molecule,  namely,  association  of  a  metal  ion  at  the 
carbonyl  oxygen  atom,  induces  striking  changes  in  its  over  all  geometry. 

In  addition  to  metal  complexes,  antipyrine  is  known  to  form  hydrogen-bonded 
complexes  as  well.  The  1 :  1  complex  between  antipyrine  and  salicylic  acid  (generally 


C(7) 


Figure  3.  Two  views  of  the  molecule  in  free  antipyrine  and  in  its  metal  complexes  along 
comparable  directions  (Singh  and  Vijayan  1973).  Solid  and  broken  lines  correspond  to  free 
antipyrine  and  calcium  hexa-antipyrine  perchlorate  respectively. 
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known  as  salipyrine)  is  typical  of  the  latter.  The  structure  of  this  complex,  determined 
by  x-ray  crystal  structure  analysis,  is  shown  in  figure  4  (Singh  and  Vijayan  1974). 
Complexation  is  achieved  through  an  intermolecular  O-H  .  .  .  O  hydrogen  bond  with 
a  carboxyl  oxygen  of  the  salicylic  acid  molecule  as  the  donor  and  the  carbonyl  oxygen  of 
the  antipyrine  molecule  as  the  acceptor.  In  the  antipyrine  molecule  in  the  complex,  the 
pyramidal  nature  of  the  hetero  nitrogen  atoms  is  intermediate  between  those  observed 
in  free  antipyrine  and  in  the  metal  complexes.  The  bond  lengths  in  the  pyrazolone 
moiety  in  the  complex  indicated  the  contributions  of  forms  A,  B  and  C  to  be  45,  34  and 
21  %  respectively,  suggesting  thereby  that  the  antipyrine  molecule  in  salipyrine  is  more 
polar  than  in  free  antipyrine  but  less  so  compared  to  the  molecule  in  the  metal 
complexes. 

It  has  been  possible  to  define  the  molecular  geometry  of  antipyrine  in  different  states 
of  association,  from  the  studies  outlined  above.  The  molecule  can  take  part  in  metal 
coordination  or  hydrogen  bonding.  Association  is  achieved  by  an  interaction  at  the 
carbonyl  group  in  both  the  cases  and  its  effect  is  (i)  to  make  the  hetero  nitrogen  atoms 
more  planar  and  consequently  to  introduce  substantial  changes  in  molecular  geometry; 
and  (ii)  to  increase  the  polar  nature  of  the  molecule.  The  interaction  of  a  metal  ion  with 
the  carbonyl  group  is  expected  to  be  stronger  than  that  arising  from  a  hydrogen  bond 
and  hence  the  planarity  of  the  nitrogen  atoms  and  the  polar  nature  of  the  molecule  are 
more  pronounced  in  metal  complexes  than  in  salipyrine. 

2.2  Pyrazolidinedione  derivatives 

Phenylbutazone  (4-butyl-l,2-diphenyl-3,5-pyrazolidine-dione)  (II)  and  its  hydroxy 
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derivative  oxyphenbutazone  are  the  best  known  pyrazolidinediones  with  aspirin-like 
activity.  The  three-dimensional  structures  of  the  two  crystallographically-independent 
molecules  in  the  crystals  of  phenylbutazone  are  shown  in  figure  5  (Singh  and  Vijayan 
1977).  The  two  molecules  have  essentially  the  same  structure  except  for  the  differences 
in  the  conformation  of  the  butyl  group.  The  two  nitrogen  atoms  in  the  five-membered 
ring  are  pyramidal  and  the  attached  phenyl  rings  lie  on  opposite  sides  of  the  plane  of  the 
pyrazole  ring.  But  for  the  presence  of  the  butyl  group,  the  molecule  could  have  had  two¬ 
fold  symmetry  about  an  axis  passing  through  the  tetrahedral  carbon  at  the  4-position 
and  the  mid-point  of  the  N-N  bond.  This  symmetry  is,  however,  prevented  by  the 


Figure  5.  Geometries  of  the  two  crystallographically  independent  molecules  in  the  structure 
of  phenylbutazone  (Singh  and  Vijayan  1977). 


substitution  of  the  butyl  group  at  the  tetrahedral  carbon  atom.  The  environments  of  the 
two  phenyl  groups  are  also  now  dissimilar  and  their  orientations  with  respect  to  the 
five-membered  ring  are  different. 

The  phenylbutazone  molecule,  being  acidic,  may  be  expected  to  interact  with  basic 
amino  acid  residues.  Attempts  were  made  to  co-crystallize  it  with  basic  amino  acids  and 
their  derivatives,  but  these  attempts  were  not  successful.  It  was,  however,  possible  to 
prepare  a  2 ;  1  crystalline  complex  of  phenylbutazone  with  the  basic  compound, 
piperazine.  The  ring  carbon  atom  C(4)  in  phenylbutazone  is  deprotonated  when 
complexation  takes  place.  The  structure  of  the  resulting  anionic  molecule  is  shown  in 
figure  6  (Singh  and  Vijayan  1977).  The  complex  is  stabilized  by  ionic  and  hydrogen 
bonded  interactions  with  the  cationic  piperazine  molecules.  There  are  significant 
structural  differences  between  the  neutral  phenylbutazone  molecule  and  the  de¬ 
protonated  anionic  molecule  in  the  complex.  The  carbon  atom  C(4)  is  tetrahedral  in  the 
former  whereas  it  is  trigonal  in  the  latter  resulting  in  different  orientations  of  the  butyl 
group  in  the  two  cases.  The  excess  electron  in  the  anionic  molecule  is  delocalised  over 
the  two  C-O  and  the  two  C-C  bonds  whereas  these  bonds  are  double  and  single 
respectively  in  the  neutral  molecule,  as  can  be  seen  from  the  bond  lengths  given  in 
figure  7.  As  C(4)  is  sp^  in  the  anion,  the  C(4)-C(7)  bond  lies  in  the  plane  of  the  five- 
membered  ring  and  the  butyl  group  has  no  preferred  orientation  with  respect  to  this 
plane;  therefore,  unlike  in  the  case  of  free  phenylbutazone,  the  two  phenyl  rings  are 
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Figure  6.  Molecular  geometry  of  the  anionic  phenylbutazone  in  Its  complex  with  piperazine 
(Singh  and  Vijayan  1977). 
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Figure  7.  Bond  lengths  of  the  pyrazolidinedione  moiety  in  the  crystals  of  phenylbutazone 
and  those  of  the  phenylbutazone-piperazine  complex. 


Structurally  indistinguishable  and  are  oriented  with  respect  to  the  five-membered  ring 
at  nearly  equal  angles.  Therefore,  the  two-fold  symmetry  of  the  molecule  is  almost 
restored  on  account  of  the  deprotonation  of  C(4). 

The  results  outlined  above,  which  have  been  corroborated  by  those  of  the  x-ray 
analysis  of  1 ;  1  complex  between  phenylbutazone  and  N-methyl  piperazine  (Toussaint 
et  al  1974);  indicate  that  C(4)  is  deprotonated,  with  accompanying  changes  in  the 
molecule,  when  phenylbutazone  interacts  with  a  basic  molecule  or  chemical  group. 
Obviously  then,  deprotonation  of  the  phenylbutazone  molecule  is  likely  to  occur  when 
it  approaches  carrier  proteins  or  the  prostaglandin  synthatase  system  as  a  result  of  its 
interaction  with  basic  amino  acid  residues.  The  consequent  characteristic  changes  in  the 
geometry  and  the  electronic  structure  of  the  molecule  is  likely  to  be  of  importance  in  its 
protein  binding  properties  and  its  inhibitory  action  against  prostaglandin  synthatase. 
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2.3  Fenamates 

Fenamates  (figure  8)  form  an  important  recently  developed  group  of  non-steroidal 
analgesics.  The  molecules  of  this  family  of  compounds  essentially  consist  of  three 
planar  groupings,  namely,  two  six-membered  rings  and  a  carboxyl  group.  The 
important  members  of  this  family  are  mefenamic  acid,  flufenamic  acid,  meclofenamic 
acid  and  niflumic  acid.  The  crystal  structures  of  all  these  compounds  have  been 
determined  (Krishna  Murthy  and  Vijayan  1979,  1981a;  Krishna  Murthy  et  al  1982; 
McConnell  1973;  McConnell  and  Company  1976).  A  striking  common  feature  of  all 
these  structures  is  the  coplanarity  of  the  six-membered  ring  carrying  the  carboxyl 
group,  the  carboxyl  group  and  the  imino  nitrogen  atom.  The  internal  N-H  .  .  .  O 
hydrogen  bond  with  the  imino  nitrogen  as  the  donor  and  one  of  the  carboxyl  oxygens 
as  the  acceptor,  also  exists  in  all  the  structures.  This  hydrogen  bond,  along  with 
resonance  interactions  between  the  six-membered  ring  and  the  carboxyl  group, 
presumably  stabilise  the  coplanar  arrangement.  The  variations  in  the  geometry  of  the 
molecules  arise  almost  exclusively  on  account  of  the  differences  in  the  orientation  of  the 
second  six-membered  ring  with  respect  to  the  rigid  coplanar  moiety. 

The  crystal  structure  of  a  1:1  complex  between  one  of  the  fenamates,  namely, 
niflumic  acid,  and  ethanolamine  has  been  determined  recently  (Dhanaraj  and  Vijayan 
1983).  As  can  be  seen  from  the  perspective  view  of  the  complex,  shown  in  figure  9,  the 
interactions  between  the  two  molecules  involve  primarily,  though  not  exclusively,  the 
deprotonated  negatively  charged  carboxylate  group  of  niflumic  acid  and  the  proto- 
nated  positively  charged  amino  group  of  ethanolamine.  It  is  interesting  to  note  that  the 
invariant  structural  features  of  the  fenamates  observed  in  their  crystal  structures  are 
retained  in  the  complex  also. 

The  work  on  fenamates,  particularly  that  involving  complexes,  is  still  in  progress. 
From  the  results  obtained  so  far,  it  would  appear  that  the  only  invariant  features  of  the 
fenamates  is  the  rigid  coplanar  geometry  of  the  six-membered  ring  carrying  the 
carboxyl  group,  the  carboxyl  group  and  the  imino  nitrogen  atom,  and  also  the  internal 

R2 


Figure  8.  General  formula  of  fenamates. 
Mefenamic  acid:  R=H;  R1=R2=CH3;  X=CH 
Flufenamic  acid:  R=R1=H;  R2=CF3;  X=CH 
Niflumic  acid:  R=R1=H;  R2=CF3;  X=N 
Meclofenamic  acid:  R=R1  =C1;  R2=CH3;  X=CH 
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Figure  9.  Molecular  structure  of  the  1 ;  1  complex  between  niflumic  acid  and  ethanolamine 
(Dhanaraj  and  Vijayan  1983). 


hydrogen  bond  connecting  the  imino  and  the  carboxyl  groups.  The  structure  of  the 
crystalline  complex  of  niflumic  acid  with  ethanolamine  indicates  that  they  are  perhaps 
retained  even  when  the  fenamates  interact  with  other  molecules.  It  appears  to  be 
reasonable  to  assume  that  these  invariant  features  are  a  requirement  for  their 
pharmacological  action. 

4.  Conclusion 

The  approach  involving  crystalline  complexes,  which  has  been  recognised  to  be  of 
considerable  utility  in  elucidating  biologically  and  evolutionarily  significant  non- 
covalent  interactions  involving  amino  acids  and  peptides  (Vijayan  1980;  Sudhakar  et  al 
1980;  Salunke  and  Vijayan  1981;  Vijayan  1983;  Suresh  and  Vijayan  1983a,  b)  has  been 
found  to  be  useful  in  studying  the  structure  and  interactions  of  drug  molecules  as  well. 
Using  this  approach,  it  has  been  possible  to  elucidate  the  molecular  geometry  of  several 
pyrazole  derivatives  and  fenamates  with  aspirin-like  activity,  in  the  free  state  as  well  as 
in  states  of  association  with  other  molecules.  This  approach  has  also  provided  valuable 
insights  into  the  nature  of  the  non-covalent  interactions  these  drug  molecules  are  likely 
to  be  involved  in,  and  the  geometric  and  the  electronic  consequences  of  such 
interactions.  These  insights  are  valuable  in  elucidating  the  molecular  mechanism  of 
action  of  aspirin-like  drugs  which  are  believed  to  exert  their  pharmacological  effects 
through  the  inhibition  of  prostaglandin  biosynthesis. 
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Structural  changes  in  the  monomeric  despentapeptide  (B30-B26) 
insulin  crystal 
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Abstract.  Insulin  modified  by  the  removal  of  its  5  B  chain  C  terminal  residues  is  monomeric 
but  remains  substantially  potent.  The  crystal  structures  of  the  beef  and  insulin  (dpi  )  with  two 
molecules  in  the  asymmetric  unit  has  been  determined  by  x-ray  analysis.  The  3-dimensional 
structure  of  dpi  proves  to  be  generally  similar  to  that  of  native  molecule  in  2  Zn  insulin.  More 
detailed  comparison  reveals  that  the  slight  differences  in  the  two  independent  molecules  of  beef 
DPI  are  distributed  uniformly  throughout  the  structure  in  contrast  to  insulin  in  2  Zn  insulin, 
where  the  structural  changes  are  concentrated  in  specific  regions. 

The  loss  of  symmetry  in  the  dpi  crystal  appears  to  be  the  inability  of  the  A9  serine  to  pack 
effectively  in  the  C2  cell.  The  efficient  packing  of  the  sheep  dpi  molecule  whose  crystal  structure 
has  also  been  determined  and  where  A9  is  glycine  supports  this  conclusion. 

Keywords.  Insulin;  despentapeptide  insulin;  crystal  structure;  monomer. 


1.  Introduction 

Insulin  molecule  has  been  at  the  centre  of  biochemical  research  ever  since  its  discovery 
in  the  early  1920s.  Its  medical  importance  in  controlling  sugar  levels  in  the  blood  and 
alleviating  diabetes  has  stimulated  study  of  its  chemical  properties,  its  behaviour  in 
solution  and  its  biological  effects.  The  molecule’s  sequence,  which  was  determined  by 
Sanger  and  his  colleagues,  provided  the  first  proper  chemical  description  of  proteins 
(Ryle  et  al  1955).  Research  into  the  hormone’s  crystal  structure  was  begun  by  Dorothy 
Hodgkin  (then  Crowfoot)  in  1935  and  consequently  insulin  crystals  were  the  subject  of 
the  earliest  protein  crystallographic  investigations  (Hodgkin  and  Riley  1968). 

A  number  of  experimental  techniques  through  which  the  phase  could  be  derived  were 
investigated  over  the  years.  One  very  important  development  in  which  Professor  S. 
Ramaseshan  played  a  leading  part,  was  the  exploitation  of  anomolous  scattering 
differences  produced  by  heavy  atoms  introduced  into  the  crystal  structure 
(Ramaseshan  and  Narayan  1981). 

The  3-dimensional  structure  of  the  insulin  molecule  was  first  determined  in  1969 
(Adams  et  al  1969).  In  the  2  Zn  insulin  crystal,  the  molecule’s  A  chain  of  21  residues  was 
found  to  fold  compactly  with  an  N  terminal  helix  extending  from  A^-Ag  running 
adjacent  and  antiparallel  to  a  C  terminal  helix  between  A12-A21.  The  B  chain  of  30 
residues  was  seen  to  contain  a  central  section  of  helix  between  B9-B19;  from  each  end 
of  this  the  N  and  C  terminal  arms  extended  around  the  A  chain. 
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In  2  Zn  insulin  the  hormone  is  organized  as  a  hexamer.  Since  1969  the  crystal 
structures  of  another  hexamer,  4  Zn  insulin  and  a  dimeric  insulin  (Cutfield  et  al  1981) 
and,  most  recently,  of  a  modified  monomeric  insulin  have  been  established. 
Crystallographic  studies  on  normal  insulins  are  restricted  to  aggregated  forms  of  the 
molecule  owing  to  its  property  of  self-assembly  to  dimer  and  hexamers  at  low 
concentrations.  The  obstacle  to  the  study  of  the  insulin  monomer  presented  by  its 
property  of  self-assembly  has  been  overcome  by  the  preparation  and  crystallisation  of  a 
modified  molecule  from  which  the  5  B  chain  C  terminal  residues  (B26-B30)  have  been 
removed.  These  5  residues  complete  the  surface  which  is  buried  by  dimer  formation  and 
provide  a  complementary  pair  of  H  bonds  to  the  partner  molecule.  Their  removal 
abolishes  the  molecule’s  capacity  to  dimerise  and  to  hexamerise  but  leaves  it  with 
substantial  potency  (Shanghai  Insulin  Research  Group  1976).  The  Chinese  in  their 
systematic  analyses  of  those  regions  of  insulin  have  studied  the  pig  dpi  molecules  in 
solution  and  in  the  crystal  co-crystallised  with  cadmium  ions.  Comparison  between  the 
pig  and  beef  and  sheep  dpi  crystal  structures  shows  their  conformations  are  generally 
very  similar  but  there  are  significant  differences  which  will  need  to  be  examined  (D 
Stuart  and  D  C  Liang,  private  communication). 

The  structure  of  the  monomer  is  of  great  interest  for  three  principal  reasons.  First, 
the  hormone  is  active  as  a  monomer.  Secondly,  insulin’s  aggregation  could  bring  about 
some  rearrangements  of  the  structure  which  in  turn  could  affect  our  ideas  of  the 
relationship  between  the  hormone’s  structure  and  activity.  Thirdly  the  molecule 
possesses  considerable  flexibility  when  organised  as  a  hexameric  structure.  Since  there 
is  evidence  that  conformational  flexibility  is  important  in  the  expression  of  the 
hormone’s  biological  activity,  crystallographic  study  of  the  monomer  free  of  the 
constraints  imposed  by  aggregation  should  give  an  insight  into  its  structural  and 
conformational  behaviour  relevant  to  its  biological  action.  It  has  emerged  that  the 
mechanism  by  which  the  molecule  adjusts  is  by  relative  movements  of  its  helical 
structures;  these  behave  as  essentially  rigid  entities,  linked  by  more  flexible  extended 
pieces  of  polypeptide  chain  (Chothia  et  al  1983).  This  behaviour  explains  nicely  the 
structural  changes  seen  in  the  insulin  molecules  in  the  insulin  hexamers  where  the 
differences  are  associated  with  the  crystal-packing  requirements  forcing  one  molecule 
to  alter.  The  despentapeptide  insulin  molecule  provides  a  further  and  similar  example 
of  this  phenomenon  which  will  be  the  subject  of  this  paper. 

2.  Experimental 

2.1  Preparation  and  crystallisation  of  dpi 

The  DPI  from  beef  and  sheep  insulin  was  prepared  by  controlled  cleavage  at  the 
B25-B26  peptide  bond  using  pepsin  at  4°C  according  to  the  procedure  described  by 
Gattner  (1975).  The  product  was  purified  by  column  chromatography  to  homogeneity. 
The  beef  and  sheep  dpi  samples  was  crystallized  using  citrate/tris  buffer  with 
(NH4)2S04  as  a  precipitant  (Bi  et  al  1983). 

Several  crystal  forms  of  beef  and  sheep  dpi  have  been  obtained  (Bi  et  al  1983)  but  only 
two  have  been  refined.  Their  main  crystallographic  details  are  given  in  table  1.  The 
structures  were  initially  determined  by  the  molecular  replacement  method  and 
subsequently  refined  by  using  the  Agarwal  (1978)  fast  Fourier  least  squares  procedure. 
Because  the  resolution  of  the  x-ray  diffraction  data  in  these  studies  is  limited,  the 
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Table  1.  Crystallographic  details  of  beef  and  sheep  dpi. 


Space 

Molecule  in 

X-ray  data 

group 

a 

b 

c 

P  ass.  unit 

resolution 

R 

Beef  DPI 

C2 

52-7 

261 

51-6 

93-4  2 

1-5  A 

17-0% 

Sheep  DPI 

C2 

53-9 

25-9 

25-6 

93-8  1 

2-1  A 

21-2% 

calculated  shifts  are  inaccurate  and  consequently  the  atomic  positions  have  been 
constrained  to  conform  to  proper  peptide  geometry  after  each  cycle  (Dodson  et  al 
1976). 

The  refinement  of  the  beef  dpi  is  now  complete  and  the  well-defined  atoms  are  placed 
with  an  accuracy  between  0  05  and  OT  A.  Inevitably  more  poorly  ordered  atoms 
(including  of  course  many  of  the  solvent  water  molecules)  are  less  accurately  positioned; 
their  errors  range  between  0- 2-0-3  A.  The  sheep  dpi  refinement  is  not  yet  finished  but 
the  well-defined  protein  atoms  are  nonetheless  determined  with  sufficient  accuracy  for 
useful  comparison  with  the  beef  dpi  molecule. 

2.2  Comparison  of  beef  dpi  and  sheep  dpi  crystal  organisation 

In  figures  1  (a)-(d)  the  packing  of  the  beef  and  sheep  dpi  molecules  in  their  unit  cells 
shows  how  one  of  the  crystal  axes  (labelled  A)  in  the  sheep  dpi  crystal  is  not  exactly 
obeyed  in  the  beef  dpi  crystal.  The  reason  for  this  is  clear.  At  A9  beef  dpi  has  a  serine 
whose  sidechains  lie  very  near  the  now  approximate  axis  of  two  fold  symmetry  (A) 
relating  the  molecules.  In  figure  I  (c)  two  sheep  dpi  molecules  related  by  the  operation 
of  the  exact  two  fold  axis  (A)  are  shown  in  a  view  perpendicular  to  the  axis.  The  two  beef 
DPI  molecules  related  by  the  now  pseudo  axis  (A)  in  their  unit  cell  are  illustrated  in  figure 
1  (d)  in  a  view  equivalent  to  that  in  figure  1  (c).  The  mismatching  illustrated  in  the  figures 
shows  that  in  order  to  accommodate  the  bulk  of  the  A9  serine  on  the  axis,  the  equivalent 
structures  in  the  two  beef  dpi  molecules  are  displaced  by  about  4  A  along  the  direction 
of  the  b  axis. 

Inspection  of  the  approximate  diad  A  in  figure  1  (b)  shows  that  the  overall  movement 
which  is  produced  by  the  need  to  pack  the  A9  serines  is  however  accompanied  by 
practically  no  change  in  the  conformation  at  A9  and  only  limited  changes  at  A8  and 
A 10.  The  ability  of  the  seryl  sidechain  to  resist  the  packing  pressures  is,  in  part  at  least,  a 
consequence  of  the  well-defined  H  bond  between  the  A9  seryl  OG  and  A 10  NH,  an 
interaction  which  is  also  present  in  2  Zn  insulin,  together  with  the  inherent  stability  of 
the  helical  organisation.  In  figure  2  the  two  beef  dpi  molecules  related  by  the  pseudo 
diad  are  overlapped,  giving  an  impression  of  the  internal  changes  in  the  molecules 
associated  with  the  loss  of  symmetry. 

2.3  Comparison  between  the  structures  of  2  Zn  insulin  and  beef  and  sheep  dpi 

The  comparison  between  the  insulin  molecule  in  2  Zn  insulin  and  the  dpi  molecules 
illustrated  in  figure  3  shows  that  they  are  all  generally  similar  except  at  the  B  chain  N 
terminal  residues  B1-B4.  These  residues  extend  away  from  the  molecule  in  dpi  in 
contrast  to  the  situation  in  2  Zn  insulin,  where  they  wrap  around  the  A  chain.  There  are 
also  significant  structural  changes  at  B25  phenylalanine,  a  C  terminal  residue  in  dpi  .  In 
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B  CHAIN 


Figure  1.  Crystal  packing  of  the  dpi  molecules  shown  as  mainchain  for  all  residues  except 
A9  which  in  sheep  dpi  is  glycine  and  beef  dpi  is  serine.  The  two-fold  axis  in  the  sheep  dpi 
are  labelled  A  &  B;  B  remains  a  crystal  axis  in  beef  dpi  while  its  two-fold  symmetry  is  broken, 
(a)  Sheep  dpi.  Note  the  water  on  the  axis  at  A,  H  bonded  to  the  2-fold  related  AlO  NH  groups. 
The  A  &  B  chains  of  the  two  2-fold  related  molecules  are  indicated;  A9  is  labelled  9. 


DPI  there  are  no  contacts  between  B25  and  the  rest  of  the  molecule;  in  2  Zn  insulin  the 
B25  residue  makes  a  well-defined  H  bond  interaction  to  A19  O  (Dodson  et  al  1979). 

More  detailed  analyses  (table  2)  reveals  that  the  dpi  molecules  from  beef  and  sheep 
are  much  nearer  to  the  molecule  1  structure  (Chothia  et  al  1983)  in  2  Zn  insulin, 
although  there  is  some  relative  movement  in  the  A  and  B  chains  (see  table  2).  This 
molecule’s  structure  also  occurs  in  4  Zn  insulin  and  in  the  dimeric  pig  and  hagfish 
insulin  crystals  (Cutfield  et  al  1981;  Reynolds  and  McCall  1983).  For  this  reason  and 
because  of  its  larger  number  of  internal  H  bonds,  molecule  1  has  been  regarded  as  likely 
to  be  near  the  structure  of  the  monomer,  a  proposal  that  the  dpi  analyses  have  quite 
confirmed. 

Figure  4(a)  shows  the  overlap  of  the  two  beef  dpi  molecules  optimised  on  the  B  chain 
helix  structure.  It  can  be  seen  that  while  the  B  chain  helices  are  very  similar  the  A  chain 
positioning  relative  to  the  B  chain  is  evidently  different  in  the  two  molecules,  indicating 
there  is  some  movement  of  the  two  chains  (rms  A  =  0-63  A).  When  the  helical  segments 
A1-A9  and  A12-A20  are  each  optimally  overlaid,  their  structural  similarity  becomes 
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Figure  1.  (Contd.) 

(b)  Beef  dpi.  Note  the  close  approach  by  the  A9  seryl  sidechains  to  the  line  of  the  local  axis  A. 
The  A  &  B  chains  of  the  two  molecules  related  by  the  local  axis  at  A  are  indicated. 


apparent  from  figures  4(b)  and  (c)  and  from  the  magnitudes  of  the  rms  discrepancies 
between  the  matched  structures  (table  2).  The  N  terminal  B  chain  residues  B1-B5 
correlate  better  with  the  A  chain  helical  segments  than  with  the  B9-B19  helix.  The 
disulphide  bond  at  A7-B7  between  the  chains  is  presumably  responsible  for  this  linked 
behaviour.  By  contrast  the  structural  changes  in  insulin  in  2  Zn  insulin,  shown  in  figure 
4(d),  are  focussed  at  A1-A8  and  A19  &  B25  sidechains. 

This  picture  of  roughly  equal  distribution  of  structural  movement  in  the  helices  in  the 
two  beef  dpi  molecules  by  which  they  have  adjusted  to  their  crystal  environment  is 
confirmed  by  comparison  with  the  sheep  dpi.  As  mentioned  earlier  this  molecule 
crystallises  with  only  one  molecule  in  the  asymmetric  unit  permitted  because  its  A9 
residue  is  glycine  which,  having  no  side  chain,  allows  symmetrical  packing  about  the 
crystallographic  two  fold  axis  (see  figure  la).  Analysis  of  the  two  independent  beef  dpi 
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Figure  1.  (Conld.) 

(c)  Sheep  dpi.  The  two  molecules  related  by  A  are  viewed  perpendicular  to  the  axis.  The  water 
on  the  axis  A  between  the  two  A 10  NH  groups  is  indicated. 


molecules  listed  in  table  2  shows  that  they  are  almost  equally  different  from  the  sheep 
DPI .  There  is  however  a  significantly  larger  difference  in  the  two  beef  and  the  sheep  dpi 
molecules  at  the  A1-A8  helix.  In  part  this  is  because  the  Al  residue  in  sheep  dpi  is  still 
not  satisfactorily  determined;  there  is  furthermore  rather  low  electron  density  at  the 
residues  A1-A4  suggesting  they  are  less  well-ordered  than  the  subsequent  structure. 
Other  differences  however  occur  at  A5-A10  which  apparently  indicate  this  helix  does 
alter  in  its  internal  structure  unlike  the  others.  There  is  on  the  other  hand  an  important 
structural  difference;  the  loss  of  the  A9  serine  in  sheep  insulin  changes  the  main  chain  H 
bonding  pattern  (between  A90G  and  AlONH).  Because  the  AlONH  no  longer 
interacts  with  a  serine  OG  (from  A9  in  beef  dpi)  it  makes  H  bonds  to  a  water  molecule 
trapped  on  the  local  axis  (A)  between  a  pair  of  two  fold  related  molecules  (figure  lb). 
The  adjustment  of  the  peptide  conformation  needed  to  make  this  interaction  then  alters 
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Figure  1.  (Contd.) 

(d)  Beef  dpi  viewed  in  the  same  direction  as  (c).The  two  molecules  are  no  longer  exactly  related 
by  an  axis  at  A;  the  effect  of  the  A9  serine  sidechain  is  a  displacement  between  the  two 
molecules  of  about  4A  in  the  direction  of  the  local  axis. 


the  mainchain  H  bond  contacts  between  A90  and  B5  imidazole  ND  and  the  A8NH  and 
A30  disturbing  the  structure  of  the  helix. 

3.  Conclusion 

The  loss  of  two  fold  symmetry  in  the  2  Zn  insulin  hexamer  is  associated  with  the 
packing  of  the  B5  histidines  in  adjacent  hexamers  stacked  along  the  three-fold  axis. 
These  would  intersect  in  the  2  Zn  insulin  crystal  organisation  if  the  molecules  both  had 
the  conformation  seen  in  molecule  1  (Cutfield  et  al  1981),  It  appears  that  in  concert  with 
structural  changes  at  the  A  chain  N  terminals  and  at  B28-B30  the  two  opposing  B5 
histidines  can  make  satisfactory  contacts  in  the  2  Zn  insulin  crystals  but  they  loose  two- 
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Figure  2.  The  overlap  of  the  two  beef  dpi  mainchain  structures  when  related  by  the  local 
diad  A.  The  A  and  B  chains  are  labelled.  Note  the  movement  in  the  A  chain  N  terminal  helix. 


MONOMERIC 

INSULIN 


2Zn  MOLECULE  1 


HEXAMERIC 

INSULIN 


Figure  3.  The  pig  2  Zn  insulin  and  beef  dpi  molecules  all  shown  in  an  equivalent  view  which 
is  perpendicular  to  the  direction  of  the  local  axis  in  2  Zn  insulin.  There  are  two  crystallographi- 
cally  independent  pig  insulin  molecules  in  2  Zn  insulin  and  two  independent  beef  dpi 
molecules  in  the  C2  beef  dpi  crystal,  and  in  each  case  both  structures  are  shown.  Only  selected 
sidechains  are  included  to  simplify  the  figure. 

The  general  similarity  in  the  chain  folding  of  the  pig  2  Zn  insulin  and  the  dpi  molecules 
is  clear;  the  changes  in  structure  at  A1-A9  in  2  Zn  pig  insulin  (molecule  1)  are  evident. 


Crystal  structure  of  despentapeptide  insulin 


191 


Table  2.  The  rms  discrepancies  between  the  helical  segments  in  Beef  and 
sheep  DPI  and  2  Zn  insulin. 


Insulin 

Helical 

Segments 

Beef 
DPI  (1) 

Beef 
DPI  (2) 

Sheep 

DPI 

2Zn  insulin 
molecule  1 

2Zn  insulin 
molecule  2 

B9-B19 

0-29 

0-29 

0-24 

024 

beef  DPI  (1) 

A1-A8 

0-31 

0-56* 

0-27 

0-94 

A12-A20 

0-27 

0-36 

0-24 

0-41 

All 

0-63 

0-43* 

0-88 

104 

B9-B19 

0-33 

0-24 

0-27 

beef  DPI  (2) 

A1-A8 

0-60* 

0-26 

115 

A12-A20 

0-36 

0-24 

0-28 

All 

0-52* 

0-61 

1-27 

B9-B19 

0-38 

0-38 

sheep  DPI 

A1-A8 

0-59* 

l-28» 

A12-A20 

0-42 

0-44 

All 

0-77* 

1  09* 

2Zn  insulin 

B9-B19 

012 

molecule  1 

A1-A8 

MO 

A12-A20 

015 

All 

0-46t 

*  These  overlapped  segments  include  residues  A2-A8  only  owing  to  the  poor  definition  of  Al. 
t  These  overlapped  segments  include  residues  A12-A20  and  B9-B19  only  owing  to  the  large  structural 
differences  at  A1-A5,  B1-B8  and  B27-B30. 


A8.A9  A8.A9 


Figure  4.  The  overlap  of  the  helical  segments  in  the  two  beef  dpi  molecules  in  the  C2  crystal  assymmetric  unit 
illustrating  their  close  structural  similarity.  Only  some  selected  sidechains  are  shown,  (a)  B9-B19.  (b)  A1-A8. 
(c)  A12-A20.  (d)  The  overlap  of  the  two  insulin  molecules  in  2  Zn  pig  insulin  carried  out  by  matching  the 
mainchain  between  A10-A20  &  B1-B27.  There  is  a  close  similarity  in  much  of  the  two  molecules’  structure, 
except  at  A1-A9  and  B5  and  at  tyrosyl  &  phenylalanyl  sidechains  A19  &  B25.  More  selected  sidechains  are 
included  than  in  a,  b  &  c  in  order  to  illustrate  that  they  also  possess  close  similarity  in  conformation. 
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Local  2-fold  axis  between  hexamers 


Figure  5.  The  crystal  packing  around  the  B5  histidines  in  pig  2  Zn  insulin.  In  accommodat¬ 
ing  to  the  packing  requirements  the  histidine  of  molecule  2  has  changed  its  conformation  and 
makes  different  internal  contacts  to  the  B5  histidine  in  molecule  1  and  no  contacts  with  the 
adjacent  molecule.  The  histidine  of  molecule  2  makes  2  well-defined  internal  H  bonds  (to  A7 
and  A9  carbonyl  O),  and  a  good  inter  hexamer  H  bond  to  A90  which  has  allowed  it  to  retain  its 
original  conformation  while  its  companion  alters  in  structure. 


fold  symmetry  in  doing  so  (figure  5).  There  are  obvious  parallels  between  the  situation 
in  2  Zn  insulin  and  the  beef  dpi  crystals  with  its  packing  together  of  the  A9  serines.  But 
the  structural  response  of  the  molecules  in  the  dpi  crystal  is  quite  different.  The  2  Zn 
insulin  changes  of  structure  are  large  but  are  limited  essentially  to  A1-A5  and  B28-B30 
(&  B25  sidechains).  The  2Zn  insulin  hexamer  probably  contains  exact  two-fold 
symmetry  in  solution;  the  perturbations  needed  to  allow  satisfactory  crystal  packing 
have  to  overcome  a  pre-existing  symmetrical  structure.  It  thus  appears  that  the 
integration  of  the  hexamer  prevents  much  deviation  from  symmetry  except  on  the 
surface  where  the  adjustments  are  relatively  large.  Those  within  the  hexamer  are  much 
smaller,  of  the  order  0-2-0-3  A  and  the  segments  themselves  change  very  little,  by 
between  OT  and  0-2  A.  The  absence  of  structural  change  at  A9  serine  and  the  structural 
change  in  the  beef  dpi  molecules  in  response  to  the  requirements  of  crystal  packing 
illustrates  the  expected  greater  flexibility  of  a  monomeric  insulin  species.  This  property, 
however,  derives  mainly  from  the  ability  of  the  various  segments  of  secondary 
structure,  such  as  the  helices,  to  move  relative  to  one  another  through  the  flexibility  of 
the  extended  pieces  of  main  chain  that  link  them  (Chothia  et  al  1983). 
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X-ray  structural  studies  on  metal  complexes  with  nucleotides  and 
pyridoxal-amino  acid  Schiff  bases.  Models  for  platinum  binding  to 
DNA  and  pyridoxal  catalyzed  reactions 

M  DAMODARA  POOJARY,  S  P  SUDHAKARA  RAO  and 
H  MANOHAR 

Department  of  Inorganic  and  Physical  Chemistry,  Indian  Institute  of  Science,  Bangalore 
560012,  India 

Abstract.  X-ray  structural  studies  on  metal  complexes  with  nucleotides  and  with  pyridoxal- 
amino  acid  Schiff  bases  are  briefly  reviewed.  The  results  with  ternary  metal  nucleotide 
complexes  show  that  the  oxopurine  nucleotides  coordinate  to  the  metal  ion  through  the  N(7) 
atoms  of  the  bases  in  cis  position.  The  relevance  of  this  mode  of  binding  is  discussed  in  terms  of 
the  possible  mechanism  of  action  of  the  novel  platinum  drugs.  On  the  basis  of  the  studies  on 
metal  pyridoxal-amino  acid  Schiff  base  complexes,  the  variations  in  stereochemistry  of  the 
ligands  in  different  metal  complexes  have  been  related  to  the  catalytic  activity  of  various  metal 
ions  in  pyridoxal-catalyzed  nonenzymatic  reactions. 

Keywords.  X-ray  crystallography;  metal-nucleotide  complexes;  metal-Schiff  base  complexes; 
pyridoxal-amino  acid  Schiff  bases;  Pt(II)-DNA  interaction. 


1.  Introduction 

Metal  ion  complex  formation  is  one  of  the  prominent  interactions  in  biological  systems. 
Groups  like  nucleotides  and  pyridoxal-amino  acid  Schiff  bases  are  important  biological 
entities  and  their  interactions  with  metal  ions  are  a  fertile  field  of  research  activity.  We 
have  initiated  work  on  structural  studies  of  ternary  metal  nucleotide  complexes  and 
metal  complexes  with  pyridoxal-amino  acid  Schiff  bases.  In  this  paper,  our  recent 
results,  as  well  as  earlier  work  in  this  area,  are  briefly  reviewed. 


2.  Models  for  platinum  binding  to  dna 

2.1  Role  of  metal  ions  in  the  biochemistry  of  nucleic  acids 

It  is  well-known  that  metal  ions  play  an  important  role  in  the  biochemistry  of  nucleic 
acids  (Eichhorn  1973).  For  example,  the  transfer  of  atp  phosphoryl,  which  is  an 
enzymatic  process  essentially  requires  the  presence  of  metal  ions.  Even  the  non¬ 
enzymatic  transphosphorylation,  which  occurs  between  atp  and  the  orthophosphate 
ion,  is  catalysed  by  metal  ions,  notably  Mn^'^,  Cd^"^,  Ca^^.  Similarly  the  cleavage  of 
pyrophosphate  from  four  deoxyribonucleotide  triphosphates,  dATP,  dcTP,  dcTP,  dTTP, 
involved  in  the  dna -polymerase  reaction,  together  with  the  formation  of  phos- 
phodiester  linkage  requires  the  presence  of  divalent  metal  ions,  mainly  Mg^'*".  It  was 
also  discovered  that  the  transition  metal  ions  like  Fe^"^  and  Cu^'^  are  components  of 
tobacco  mosaic  virus  dna  ,  and  they  are  strongly  bound  to  the  ligating  sites  in  the  nucleic 
acid.  More  recent  investigations  have  shown  that  Zn^^  is  essential  for  the  activity  of 
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Escherchia  coli  polymerase  1.  Moreover  the  in  vivo  role  of  metal  ions  in  nucleic  acid 
chemistry — ^dna  replication,  transcription  and  translation;  dna  denaturation  and 
renaturation;  rna  conformational  properties;  depolymerization  of  rna;  enzyme-metal- 
nucleic  acid  ternary  species  has  been  well  recognised  (Eichhorn  1973).  Furthermore,  the 
clinical  success  of  certain  transition  metal  complexes,  e.g.  cis-[Pt(NH3)2Cl2] 
(Rosenberg  et  al  1969)  and  the  speculation  that  these  complexes  may  act  by  crosslinking 
DNA  in  vivo  has  promoted  research  activity  in  this  area.  Also  the  unique  properties  of 
metals  are  used  as  probes  for  nucleic  acid  chemistry,  for  structural  investigations  and 
for  isolation  of  these  biologically  important  molecules  (Eichhorn  1973;  Marzilli  1977; 
Hodgson  1977). 

Although  the  importance  of  metal  ions  in  nucleic  acid  processes  is  well  estabhshed  their 
mechanism  of  action  is  little  understood.  As  these  processes  are  selective  in  nature,  the 
interaction  of  inorganic  species  with  nucleic  acids  has  also  to  take  place  through  selective 
coordination.  The  polynucleotides  offer  three  major  ligating  sites  to  metal  ions:  (i)  the 
heterocyclic  ring  nitrogen  atoms,  especially  N(l)  and  N(7)  of  purines  and  N(3)  of 
pyrimidines,  and  the  exocyclic  functional  groups  of  purine  and  pyrimidine  bases;  (ii)  the 
phosphate  oxygens  of  phosphodiester  linkages;  (iii)  the  ribose  moiety.  A  knowledge  of 
the  metal  ion  binding  sites  for  each  of  these  component  nucleotides  (chart  I),  therefore, 
is  of  great  value  in  assessing  how  the  metal  ions  might  interact  with  nucleic  acids. 

This  led  several  workers  to  investigate  metal  complexes  of  nucleic  acids  and  their 


a)  Adenosine 

b)  R=NH2  Ganosine 
R=H,  Inosine 

f)  Deoxyribonucleotide 


c)  Ribonucleotide 

d)  Cytidine 

e)  R'=H,  Uridine 
R'=CH3,  Thymidine 


Chart  1. 
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constituents:  nucleotides,  nucleosides  and  bases,  while  earlier  studies  in  this  direction 
carried  out  in  solution  and  using  spectroscopic  methods  are  significant,  more 
experimental  data  are  necessary  to  determine  with  certainty  the  mode  of  binding  of 
metals  and  their  effects  on  the  structure  and  conformations  of  nucleic  acids.  Among  the 
various  physical  techniques  suitable  for  studying  substances  in  the  crystalline  state, 
x-ray  diffraction  has  proved  to  be  a  powerful  tool  to  elucidate  the  structure  of 
complexes  formed  by  transition  metal  ions  with  nucleic  acid  constituents.  A  great 
wealth  of  information  regarding  binding  sites,  conformations,  packing,  hydrogen 
bonding,  etc.  has  been  accumulated  in  nucleic  acid  constituents  in  their  complexes 
(Hodgson  1977;  Marzilli  1977;  Marzilli  and  Kistenmacher  1977;  Gellert  and  Bau  1979; 
Swaminathan  and  Sundaralingam  1979;  Aoki  1981). 

As  mentioned  earlier,  the  mixed  ligand  complexes  have  been  recognised  as  the  low- 
molecular  weight  representatives  for  the  active  catalytic  centres  occurring  in  vivo.  A 
ternary  complex  involving  a  metal  ion,  a  nucleotide  and  a  secondary  ligand  such  as  2-2'- 
bipyridine  can  mimic  enzyme-metal  ion-substrate  interactions  in  biological  systems 
(Naumann  and  Sigel  1974).  Moreover  it  has  been  suggested  that  in  solution,  in  the 
presence  of  a  7r-aromatic  amine,  metal-nucleotide  interaction  takes  place  predomi¬ 
nantly  through  phosphate  oxygens,  while  the  base  of  the  nucleotide  does  not  take  part 
in  coordination  (Naumann  and  Sigel  1974;  Sigel  1975;  Mitchell  and  Sigel  1978).  This 
observation  is  attributed  to  the  discriminating  qualities  of  7i-aromatic  amines  to  the 
metal  ions,  which  in  turn  is  one  of  the  tools  employed  by  nature  to  achieve  selectivity. 
Hence  one  is  tempted  to  predict  that  much  of  the  specificity  and  many  of  the  properties 
so  far  only  associated  with  metal  ion  complexation  in  macromolecular  biological 
systems  are  inherent  already  in  relatively  simple  ternary  systems. 

On  the  other  hand,  an  entirely  different  binding  mode  has  been  observed  in  some 
ternary  platinum  oxopurine  nucleotide  complexes  (Aoki  1981).  The  nucleotides 
coordinate  to  the  metal  ion  in  cis  position  through  the  N(7)  atoms  of  the  purine  bases. 
Such  a  type  of  binding  has  been  observed  (Poojary  and  Manohar  1982, 1983)  in  ternary 
Co^"^,  Ni^'^  and  Cu^^  oxopurine  complexes.  The  importance  of  these  compounds  as 
models  for  platinum  binding  to  dna  is  discussed  below. 

2.2  Anti-cancer  activity  of  cis-oDP 

Since  the  discovery  of  the  anti-tumour  activity  of  cis-  [Pt(NH3),CI,]  (ciS-DDP) 
(Rosenberg  et  al  1969),  considerable  research  has  been  focussed  on  elucidating  the 
mechanism  of  activity  of  these  drugs.  The  following  common  features  which  link 
structure  to  reactivity  have  emerged  from  these  studies  (Rosenberg  1980):  (i)  The 
complexes  have  both  inert  and  labile  ligands  and  only  the  latter  ligands  undergo 
labilization  quickly  in  reactions  with  biological  molecules,  (ii)  The  exchangeable 
leaving  groups  should  be  in  the  cis  configuration,  (iii)  The  complexes  should  be  neutral, 
(iv)  The  geometry  of  the  complexes  is  either  square  planar  or  octahedral,  (v)  Two  cis 
monodentate  or  one  bidentate  leaving  groups  are  required,  (vi)  The  leaving  ability  of 
these  labile  groups  should  be  moderate,  (vii)  The  leaving  groups  should  be  separated 
approximately  by  3-4  A. 

Studies  on  cis-ddp  and  its  analogues  clearly  suggest  that  activity  of  these  drugs 
depends  on  the  cis  oriented  labile  ligands  in  the  coordination  plane  of  platinum.  In 
aqueous  solution  these  labile  ligands  may  be  easily  displaced,  and  it  has  been  speculated 
that  the  resulting  cis  (NH3)2Pt^^  moiety  is  the  active  species  which  interacts  with 
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biological  target  molecule  (Barton  and  Lippard  1980).  Studies  have  also  shown  dna  as 
cellular  target  for  cis  ddp.  In  binding  to  dna,  cis  ddp  could  block  replication,  which 
would  suppress  rapid  growth  of  cancerous  tissues. 

2.3.  Possible  mechanisms  of  action  of  cis-DDP 

A  variety  of  structural  models  for  the  ligating  site  in  dna  for  cis  ddp  have  been 
proposed.  Studies  have  shown  that  N(7)  nitrogen  atom  of  purine  and  N(3)  nitrogen 
atom  of  pyrimidines  are  the  primary  sites  of  coordination  for  the  platinum  atom. 
However,  this  unidentate  coordination  is  not  sufficient  to  explain  the  observed 
differences  in  activity  of  cis  and  trans  isomers.  Potentiometric  studies  (Macquet  and 
Theophanides  1975)  show  that  on  binding  of  cis  ddp  to  dna  both  the  chlorine  ions  are 
released,  while  the  release  of  one  chlorine  ion  is  observed  on  binding  of  trans  isomer. 
Thus  N(7)-0(6)  chelation  was  proposed  on  the  basis  of  bifunctional  mode  of 
coordination.  The  trans  isomer  could  not  bind  to  these  sites  simultaneously.  However, 
geometrical  considerations  required  for  such  chelation  have  led  many  to  suggest  that 
such  a  structure  is  unlikely. 

Other  possible  bifunctional  coordination  modes  include  the  formation  of  either  intra 
or  interstrand  crosslinks.  The  crosslinking  of  base  residues  on  opposite  strands  of  dna 
would  severely  hinder  replication.  Even  though  interstrand  crosslinking  of  dna  does 
occur  in  the  presence  of  cis  ddp,  the  frequency  of  this  event  is  too  low  to  account  for  its 
antitumour  activity  (Roberts  and  Thomson  1979).  At  present  it  appears  more  probable 
that  the  site  of  interaction  involves  an  intrastrand  crosslinking.  The  observation  that 
the  nonbonded  distance  between  the  chlorine  ions  in  cis  ddp  is  34  A,  the  base  pair 
stacking  distance  suggests  that  intrastrand  crosslinking  is  chemically  more  reasonable. 
Such  a  crosslink  would  necessarily  result  in  an  unstacking  of  bases.  The  base  tilting 
required  to  accomplish  this  crosslink  would  lead  to  local  denaturation.  This  intrastrand 
crosslink  could  be  formed  by  platinum  binding  to  N(7)  nitrogen  atoms  of  the  adjacent 
guanine  bases. 

2.4  Intrastrand  crosslinking  models 

Even  though  information  about  the  binding  of  cis  ddp  to  the  polymer  is  scarce,  some 
recent  studies  on  monomers  allow  interesting  speculations  to  be  made.  One  increas¬ 
ingly  mentioned  mode  of  binding  to  the  cis  isomer  is  intrastrand  crosslinking. 
Goodgame  et  al  (1975)  described  a  c/s-Pt(NH3)2"^  complex  of  5'-imp,  cis-[Pt(5'- 
imp)2(NH3)2]^“,  wherein  Pt(II)  binds  through  equatorial  sites  to  two  5'-imp  ligands 
which  are  related  by  a  crystallographic  two-fold  axis.  Similar  type  of  complexes  were 
reported  with  different  inert  ligands  like  ethylenediamine  and  trimethylenediamine 
(Kistenmacher  et  al  1978;  Bau  and  Gellert  1978;  Kistenmacher  et  al  1979;  Marzilli  et  al 
1 980).  Most  of  these  complexes  are  nonstoichiometric.  All  these  structures  bear  a  strong 
resemblance  to  the  structure  of  monosodium  salt  of  5'-imp  (Rao  and  Sundaralingam 
1969),  where  a  water  molecule  lies  on  a  crystallographic  2-fold  axis  and  links,  via 
hydrogen  bonds,  two  symmetry-related  nucleotides.  In  the  isomorphous  platinum 
complexes  this  water  site  is  occupied  by  the  Pt(II)  moiety  and  the  hydrogen  bonding 
scheme  present  in  the  sodium  salt  is  replaced  by  two  cis  coordination  bonds.  In  terms  of 
the  possible  effect  on  a  polynucleotide  structure  due  to  intrastrand  crosslinking,  the 
structural  similarities  displayed  by  these  5'-imp  salts  are  particularly  important.  Chiang 
et  al  (1978)  showed  that  there  is  a  strong  competition  between  the  crystal  packing  forces 
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operative  in  the  solid  and  the  distortion  of  the  basic  structure  due  to  the  simultaneous 
binding  of  the  two  5'-imp  bases  by  Pt(II)  moiety.  In  fact  if  one  assumes  that  the  crystal 
packing  forces  operative  in  the  5'-imp  complexes  approximate  those  found  in  the  base- 
stacked  polynucleotide,  the  above  result  suggests  that  intrastrand  crosslinking  by 
reagents  such  as  cis  ddp  will  place  a  significant  strain  on  the  conformational  properties 
of  the  polynucleotide. 

All  the  platinum-5'-iMP  complexes  discussed  above  are  1 : 2  complexes.  The  binding 
of  metal  to  the  oxopurine  nucleotides  in  cis  position  has  also  been  displayed  by  1 : 1 
ternary  metal-nucleotide  complexes  containing  ethylenediamine  and  metal  ions  like 
Co(III)  and  Ni(II)  (Poojary  and  Manohar  1982).  Again  these  complexes  are  isomor- 
phous  among  themselves  and  isostructural  with  platinum  complexes  and  sodium  salts 
of  oxopurine  nucleotides.  For  comparison,  structures  of  the  sodium  salt,  the  platinum 
and  cobalt  5'-imp  complexes  are  given  in  figure  1.  The  geometry  of  the  nucleotides 
around  Pt(II)  and  Co(III)  is  the  same.  An  interesting  feature  of  the  Co(III)  and  Ni(II) 
complexes  is  that  the  ethylenediamine  molecule  has  been  displaced  from  the  metal 
centre,  to  which  the  nucleotides  coordinate.  This  results  in  one  metal  ion  interacting 
only  with  ethylenediamines  and  the  other  with  nucleotides.  The  remaining  octahedral 
coordination  sites  are  occupied  by  waters.  Both  the  metal  ions  lie  on  different  two-fold 


Figure  1.  A  comparison  of  structures  of  (a)  monosodium  salt  of  5'-lMPH,  (b)  [Pt(5'- 
IMP)2  (NH3)2]^'  and  (c)  [Co(5'-IMP)  (en)  (H20)3]'^.  Horizontal  and  vertical  lines  represent 
two-fold  rotation  axes.  Broken  lines  indicate  hydrogen  bonds.  (Figures  1(a)  and  (b)  are 
reprinted  with  permission  from  Chiang  et  al  1978;  Figure  1(c)  with  permission  from  Poojary 
and  Manohar  1982). 
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axes.  Chlorine  ions  which  neutralize  the  charge  in  the  Co(III)  complex  are  disordered  in 
the  vacant  site  between  the  molecules.  The  ethylenediamine  molecules  are  also 
disordered. 

The  5'-gmp  complexes  of  Co(III)  and  Ni(II),  like  the  5'-imp  complexes,  are  based  on 
the  sodium  salt  of  5'-gmp  (Katti  et  al  1981).  In  the  sodium  salt  of  5'-gmp,  a  sodium  ion 
links  the  two  nucleotide  moieties.  This  sodium  ion  is  replaced  by  the  metal  ions  Co(III) 
and  Ni(II).  Recently  a  similar  binding  has  been  observed  (Poojary  and  Manohar  1983) 
in  a  ternary  Cu-5'-imp  complex,  containing  imidazole  groups.  This  complex  is 
isomorphous  with  cis  [Pt(5'-iMp)  (NH3)2]^  ~ .  The  imidazole  moieties  are  in  cis  position 
with  a  partial  occupancy  of  0-4,  while  for  the  rest  of  the  time  these  sites  are  replaced  by 
waters.  The  cis  binding  of  nucleotides  to  Cu(II)  ion  in  the  presence  of  a  7c-aromatic 
amine  (imidazole)  is  at  variance  with  the  results  of  crystallographic  studies  on  ternary 
complexes  containing  other  7r-aromatic  amines  (Aoki  1981)  and  contrary  to  the 
conclusions  derived  from  solution  studies  mentioned  earlier. 

3.  Model  compounds  for  pyridoxal  catalyzed  reactions 

3.1  Role  of  metal  ions 

Enzymes  dependent  on  pyridoxal  phosphate  (plp)  (the  cofactor  form  of  vitamin 
catalyze  a  variety  of  metabolic  reactions  of  amino  acids  such  as  decarboxylation, 
transamination,  racemization  and  carbon-carbon  bond  cleavage.  Many  nonenzymatic 
model  reactions  also  proceed  by  similar  mechanisms  in  the  presence  of  pyridoxal  (pl) 
and  a  suitable  metal  ion  (Holm  1973).  A  general  theory  on  the  mechanism  of 
nonenzymatic  reactions  essentially  involves  the  following  steps:  (i)  Formation  of  a 
Schiffbase  with  subsequent  labilization  of  bonds  attached  to  the  a-carbon  atom  due  to 
electron  displacement  from  the  a-carbon  atom  to  the  electronegative  ring  nitrogen 
through  a  conjugated  system  of  double  bonds,  (ii)  Release  of  H  ,  COOH  ^  or  R  ^  from 
the  a-carbon  atom  depending  upon  their  orientation  with  respect  to  the  extended 
TT-system  (aldimine  plane)  to  produce  a  transitional  Schiffbase,  (iii)  Localization  of  the 
lone  pair  of  electrons  of  the  heterocyclic  nitrogen  and  subsequent  protonation  either  at 
the  a-carbon  atom  for  racemization  and  decarboxylation  or  at  the  formyl  carbon  for 
transamination,  (iv)  Hydrolysis  of  the  carbon-nitrogen  bond  to  give  the  products. 

The  mechanism  for  the  first  stage  of  transamination  reaction  is  given  in  chart  2.  The 
imines  2  and  4  have  been  detected  spectrophotometrically,  but  direct  evidence 
supporting  the  existence  of  intermediate  3  is  lacking.  It  has  been  suggested  that  the  role 
of  the  metal  ion  is  to  stabilize  the  imines  2  and  4  through  chelation  in  a  tridentate 
fashion,  which  provides  a  planar  conjugated  system  via  3  and  increases  the  inductive 
effect  away  from  the  a-carbon  atom  (Metzler  and  Snell  1952).  Thus,  it  is  now  believed 
that  the  metal  ion  serves  as  a  trap  for  the  preformed  Schiff  base. 

It  is  of  interest  to  note  that  different  metal  ions  have  varied  catalytic  influence.  The 
efficiency  towards  pl  transamination  was  found  to  be  in  the  order  Co^^^  <  Ni^"^ 
<  Zn^^  <  Fe^^  <  Fe^^  <  Al  <  Cu^"'',  from  the  investigation  of  the  influence  of 
metal  ions  on  the  rate  of  the  transamination  of  pyridoxamine  with  a-ketoglutarate 
(Longenecker  and  Snell  1957).  As  it  is  conjectured  that  the  metal  ion  has  no  kinetic  role 
in  the  Schiff  base  formation,  apparently  the  cause  for  varied  catalytic  influence  of 
different  metal  ions  lies  in  the  stereochemistry  of  metal-Schiff  base  complex.  From 
pH-metric  titrations  (Christensen  1957),  the  expected  downward  shift  in  the  pK  value 
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of  the  pyridine  nitrogen  in  Ni(II),  Mn(II)  and  Zn(II)  chelates  was  not  observed  and  this 
result  was  attributed  mainly  to  an  unstable  metal-phenolic  oxygen  bond  in  solution. 
But  in  the  solid  state,  the  crystal  structures  of  [Mn(PL-DL-valine)2]  (Willstadter  et  al 
1963),  [Ni(PL-DL-valine)2]  (Capasso  et  al  1974)  and  [Zn(PL-L-valine)2]  (Capasso  et 
al  1974)  have  shown  that  metal-phenolic  oxygen  bond  is  equally  as  strong  as  other  metal- 
donor  bonds.  Capasso  et  al  (1974)  speculated  that  the  puckering  of  the  chelated  Schiff 
bases  could  play  a  decisive  role  in  catalytic  activity.  In  [Ni(PL-glycine)2]  and  [Cu(PLP- 
glycine)  (H2O)]  complexes  also  contrasting  features  have  been  observed  with  respect  to 
the  stereochemistry  {vide  infra),  but  not  in  relation  to  the  strength  of  the  metal-donor 
bonds  (Rao  and  Manohar  1983). 

3.2  Stereochemistry  of  metal  pyridoxal-amino  acid  complexes 

Because  of  the  tridentate  nature  of  the  Schiff  base  ligand,  complexes  with  2 : 1  and  1 : 1 
ligand  metal  stoichiometry  can  be  formed.  The  heterocyclic  nitrogen  may  be 
protonated  according  to  the  charge  neutralization  requirement.  In  1 : 1  complexes, 
confined  to  copper(II)  only,  approximate  square-pyramidal  coordination  geometry 
exists,  in  which  three  positions  in  the  basal  plane  are  occupied  by  the  aldimine  ligand.  In 
[Cu(PL-DL-valine)]  (Dawes  et  al  1982)  the  fourth  basal  position  and  the  fifth  apical 
position  are  occupied  by  a  pyridine  nitrogen  and  a  hydroxymethyl  oxygen  respectively 
of  different  neighbouring  molecules,  whereas  in  [Cu(PLP-DL-phenylalanine)  (H2O)] 
(Bentley  et  al  1968)  they  are  filled  by  a  water  molecule  and  phosphate  oxygen  of  another 
molecule.  In  [Cu(PL-o-phospho-DL-threonine)]  (Aoki  and  Yamazaki  1980)  the 
phosphate  oxygen  from  a  neighbouring  molecule  acts  as  the  fourth  donor  which  results 
in  a  dimeric  structure,  the  fifth  being  a  water  molecule.  The  copper  complex  with  PL-L- 
histidine  has  recently  been  reported  (Dawes  and  Waters  1982)  to  be  trimeric  with  the 
histidine  nitrogen  of  another  ligand  of  the  same  trimer  and  a  water  molecule  being  in 
the  fourth  and  fifth  coordination  positions  respectively. 
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Figure  2.  Structure  of  [Cu(PLP-giycine)  (H2O)]  3H2O  with  atom  numbering  scheme. 


In  [Cu(PLP -glycine)  (H2O)]  (Rao  and  Manohar  1983)  a  polymeric  structure  is 
observed,  the  axial  site  being  occupied  by  a  phosphate  oxygen  of  a  neighbouring 
molecule  translated  along  a  axis;  a  water  molecule  binds  at  the  remaining  equatorial  site 
(figure  2).  There  are  three  more  water  molecules  in  the  asymmetric  unit  which  link  the 
two  polymeric  chains  through  hydrogen  bonds. 

In  2 : 1  complexes,  chelation  takes  place  on  an  octahedral  edge  so  that  the  two  imine 
nitrogens  are  trans  and  the  two  phenolate  and  carboxylate  oxygens  are  mutually  cis  to 
each  other.  From  a  structural  point  of  view,  such  complexes  are  interesting  in  that  the 
bis  (tridentate)  complexes  have  the  abf  structure  for  which  two  enantiomorphous  forms 
A  and  B  are  possible  (chart  3).  The  presence  of  an  asymmetric  centre  in  the  condensed 
amino  acid  generates  the  diastereoisomeric  species,  A{LL)  =  B{DD),  A{DD)  =  B{LL) 
and  A  {DL)  =  B  {LD)  where  A  and  B  stand  for  the  two  possible  enantiomorphous  forms 
of  the  complex,  D  and  L  refer  to  the  configuration  of  the  two  Schiff  base  ligands. 

However,  it  has  been  observed  that  either  the  enantiomorphous  pair  A{LL),  B{DD) 
(or  A{DD),  B{LL))  are  present  in  the  solid  state  as  found  in  [Mn(PL-DL-valine)2] 
(Willstadter  et  al  1963)  and  [Ni(PL-DL-valine)2]  (Capasso  et  al  1974)  or  only  the 
A(LL)  diastereoisomer  as  observed  in  [Zn(PL-L-valine)2]  (Capasso  et  al  1974).  The 
fact  that  complexes  such  as  A{DL)  or  B{DL)  are  not  formed,  as  expected,  with  racemic 
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Schiff  bases  with  the  same  yield  as  A{LL)  and  B{DD)  indicate  that  the  formation  of 
crystalline  complexes  is  stereospecific.  In  [Ni(PL-glycine)2]  complex  (Rao  and 
Manohar  1983),  only  enantiomers  A  and  B  are  present  but  not  those  arising  from  the 
chirality  of  the  amino  acid,  as  glycine  is  nonchiral. 

[Ni(PL-o-phospho-DL-threonine)  (1120)3]  (Aoki  and  Yamazaki  1980)  is  probably 
the  only  noncopper  1 ;  1  transition  metal  pyridoxal-amino  acid  complex  reported  in  the 
literature.  For  this  complex,  three  octahedral  coordination  sites  are  occupied  by  the 
tridentate  ligand  and  the  remaining  three  by  water  molecules.  The  phosphate  unit  of  the 
amino  acid  forms  an  intramolecular  hydrogen  bond  with  the  axial  water  molecule. 

Long  et  al  (1980)  reported  a  binuclear  iron(III)  complex,  [Fe(N-(PL)2  alanine) 
C104]2,  with  an  unusual  pl  derivative.  It  is  proposed  that  the  ligand  has  resulted  from 
the  formation  of  Schiff  base  between  PL  and  pyridoxamine  and  the  subsequent 
nucleophilic  attack  of  the  Schiff-base  imino  group  upon  the  a-keto  carbon  atom  of 
pyruvic  acid.  In  the  dimer,  two  Fe(III)  ions  are  bridged  by  two  a-keto  oxygens  of 
pyruvic  acid  groups.  Both  Fe(III)  ions  have  a  highly  distorted  octahedral  geometry  with 
the  other  coordination  sites  of  each  ion  being  occupied  by  two  phenolic  oxygens,  one 
carboxylate  oxygen  and  one  imine  nitrogen. 

3.3  Reactivity 

It  may  be  expected  that  the  stereochemistry  and  ring  strain  of  the  Schiff  base  ligand 
could  explain  the  rate  and  specificity  of  reactions,  in  particular,  the  lower  and  higher 
activities  of  the  Ni(II)  and  Cu(II)  chelates  respectively.  Mechanisms  of  pyridoxal-amino 
acid  reactions  postulated  involve  electron  transfer  via  the  extended  71-orbital  system. 
This  requires  the  Schiff  base  ligand  to  be  planar.  However,  it  has  been  observed  that  in 
metal  chelates  the  ligand  itself  is  not  planar  and  the  distortion  from  planarity  is  mostly 
due  to  a  rotation  of  carboxy  group  around  the  C(9)-C(10)  bond  (figure  2).  This 
nonplanarity  is  more  pronounced  in  Ni(II),  Zn(II)  and  Mn(II)  complexes  than  in  Cu(II) 
chelates.  For  example,  the  dihedral  angles  between  rings  I  and  II  and  II  and  III 
(figure  3)  are  4°  and  0-7°  respectively  in  [Cu(PLP-glycine)  (H2O)],  whereas  they  are  as 
high  as  18-6°  and  35.9°  in  [Ni(PL-glycine)2]  (Rao  and  Manohar  1983).  These  differences 
in  ligand  distortions  may  be  attributed  to  the  different  metal-donor  distances  for 
different  metals.  Copper  ion,  having  a  mean  metal-donor  distance  less  than  2  A 


Figure  3.  Metal-Schiff  base  complex  showing  the  chelate  rings  and  bonds  around  a-carbon 
atom. 
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occupies  the  center  of  a  nearly  planar  tridentate  ligand.  The  increased  coordination 
bond  lengths  in  the  Mn(II),  Ni(II)  and  Zn(II)  complexes  (200~2T5A)  result  in 
increased  ring  strain  which  determines  the  puckering  of  the  chelated  Schilf  bases.  Thus, 
the  Schiff  base  ligated  to  the  Cu(II)  ion  has  the  least  distortion  which  is  essential  for 
facile  electron  transfer  and  this  explains  the  higher  activity  of  copper  chelates. 

The  various  reactions  involving  a-amino  acids  in  model  systems  involve  the 
formation  of  1 ;  1  and  2 : 1  aldimine  complexes  followed  by  the  breaking  of  a,  b  or  c 
bond  in  the  condensed  amino  acid  portion  (figure  3).  Dunathan’s  hypothesis  (1966) 
states  that  the  bond  perpendicular  to  the  extended  Tt-system  (which  optimizes  orbital 
interactions)  is  labilized  most.  Accordingly  for  [Ni  (PL-o-phospho-DL-threonine) 
(H20)3]  and  [Cu(PL-o-phospho-DL-threonine)],  Aoki  and  Yamazaki  (1980)  consid¬ 
ered  the  orientation  of  a-C-hydrogen  atom  and  compared  this  to  the  relative  activity. 
The  a-C-hydrogen  atom  is  positioned  almost  in  the  PL-aldimine  plane,  the  torsion 
angles  C(7)-N(2)-C(8)-H  being  24°  for  the  Ni  complex  and  33-5°  for  the  Cu  complex. 
This  observation  led  them  to  rationalize  that  the  Cu(II)  ion  under  acidic  conditions  and 
the  Ni(II)  ion  lack  catalytic  activity  (for  the  ^^-elimination  reaction). 
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Macromolecular  crystallography  research  at  Trombay 


K  K  KANNAN  and  R  CHIDAMBARAM 

Neutron  Physics  Division,  Bhabha  Atomic  Research  Centre,  Bombay  400085,  India 

Abstract.  Neutron  diffraction  studies  of  hydrogen  positions  in  small  molecules  of  biological 
interest  at  Trombay  have  provided  valuable  information  that  has  been  used  in  protein  and 
enzyme  structure  model-building  and  in  developing  hydrogen  bond  potential  functions.  The 
new  R-5  reactor  is  expected  to  provide  higher  neutron  fluxes  and  also  make  possible  small- 
angle  neutron  scattering  studies  of  large  biomolecules  and  bio-aggregates.  In  the  last  few  years 
infrastructure  facilities  have  also  been  established  for  macromolecular  x-ray  crystallography 
research.  Meanwhile,  the  refinement  of  carbonic  hydrases’  and  lysozyme  structures  have  been 
carried  out  and  interesting  results  obtained  on  protein  dynamics  and  structure-function 
relationships.  Some  interesting  presynaptic  toxin  phospholipases  have  also  been  taken  up  for 
study. 

Keywords.  Protein  crystallography;  enzyme  mechanism;  neutron  diffraction;  x-ray  diffraction; 
carbonic  anhydrase;  lysozyme;  presynaptic  toxins. 


1.  Introduction 

The  function  of  biological  macromolecules  is  often  dependent  on  the  ability  of  these 
molecules  to  transfer  protons  efficiently,  which  invariably  involves  active  site  hydrogen 
bonding.  The  advent  of  neutron  diffraction  has  enabled  the  accurate  determination  of 
hydrogen  atom  positions  in  small  molecules  of  biological  interest  like  amino  acids  and 
peptides  and  the  stereochemistry  of  the  hydrogen  bonds  in  these  structures 
(Ramanadham  and  Chidambaram  1978).  These  studies  have  provided  valuable 
information  for  use  in  protein  and  enzyme  structure  model-building  and  have 
also  enabled  the  development  of  potential  functions  for  hydrogen  bonds  between 
peptide  groups  (Chidambaram  et  al  1970)  to  be  used  in  protein  conformation  studies. 
Neutron  diffraction  of  biological  molecules  have  been  studied  at  Trombay  using  the 
neutron  beams  from  the  emus  reactor  and  a  computer-controlled  neutron  dif¬ 
fractometer  (Momin  et  al  1978).  The  latter  and  an  x-ray  diffractometer  based  on  the 
same  design  are  incidentally  the  first  (and  so  far  the  only)  computer-controlled 
diffractometers  built  in  the  country.  The  commissioning  in  1984  of  the  lOO-MW 
research  reactor  R-5  which  will  have  several  times  the  neutron  flux  of  the  emus 
reactor  will  facilitate  study  of  biological  molecules  using  neutron  beams.  This  reactor 
with  a  built-in  cold  source  will  also  have  a  facility  for  small-angle  neutron  scattering 
(K  R  Rao,  Private  Communication)  that  can  be  used  for  studying  large  biomolecules 
and  bio-aggregates  using  the  powerful  contrast-matching  method  with  deuterium 
substitution. 

X-ray  diffraction  techniques  are  most  suited  for  studying  globular  protein  and 
enzyme  structures  though  some  neutron  diffraction  studies  have  also  been  attempted  at 
Brookhaven  and  Grenoble  (Chidambaram  1981).  The  study  of  conformation  of 
proteins  was  iriitiated  by  Prof.  G  N  Ramachandran  in  India  and  he  and  his  colleagues 
have  made  outstanding  contributions  which  are  too  well-known  to  be  recounted  here. 
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The  need  for  experimental  studies  in  India  of  macromolecular  structures  especially 
protein  structures,  by  x-ray  diffraction  methods  has  been  felt  for  sometime.  In  1978, 
when  there  was  little  effort  being  made  in  India  in  this  direction,  we  decided  to  initiate  a 
programme  of  study  of  enzyme  structure  and  function  by  x-ray  diffraction  methods. 
From  the  instrumentation  and  computational  knowhow  generated  from  our  earlier 
crystallography  work  and  the  wide  interest  in  related  fields  in  other  divisions  in  barc 
working  on  the  biochemistry  aspects  of  proteins  and  other  macromolecules,  our  group 
seemed  particularly  well-suited  to  undertake  this  research. 

1.1  Why  protein  structure? 

Proteins  are  one  of  the  most  important  components  of  any  living  system  and  carry  out 
all  the  vital  catalytic  functions  of  the  organisms.  Protein  enzymes  exhibit  a  remarkable 
specificity  towards  their  substrate.  This  specificity  of  the  enzyme  is  mostly  due  to  the 
spatial  geometry  of  the  active  site  of  the  enzyme,  which  is  determined  by  the  three- 
dimensional  structure  of  the  enzyme  itself  Thus  an  understanding  of  the  molecular 
aspect  of  enzyme  catalysis  necessarily  requires  the  understanding  of  the  three- 
dimensional  structure  of  the  enzyme.  The  elucidation  of  the  enzyme  mechanism  is  a 
difficult  process  and  requires  the  study  of  the  three-dimensional  structure  of  the 
enzyme  at  different  pH  values  and  also  of  the  enzyme-inhibitor  and  the  enzyme- 
substrate  complex  (Blake  et  al  1967;  Lipscomb  et  al  1968;  Blow  et  al  1969;  Kannan  et  al 
1977b).  The  determination  of  the  active  site  geometry  requires  the  availability  of 
accurate  coordinates  of  the  atoms  that  make  up  the  protein  structure.  The  structure  of 
the  solvent  molecules  that  are  important  components  of  the  live  enzyme  has  also  to  be 
determined  with  a  good  degree  of  precision.  A  knowledge  of  the  mode  of  inhibitor 
binding  is  also  essential  for  an  understanding  of  drug  interaction  with  proteins 
(Kannan  et  al  1977c). 

2.  Protein  structure  determination 

The  all-important  tertiary  structure  of  the  protein  can  be  determined  only  by  single 
crystal  x-ray  diffraction  analysis.  The  method  of  isomorphous  replacement,  i.e.  use  of 
different  heavy  atom  derivatives  of  the  protein,  has  proved  to  be  the  single  most 
important  method  to  elucidate  the  structure  of  proteins  (Green  et  al  1954;  Blundell  and 
Johnson  1976).  Where  complete  or  partial  structure  of  a  related  protein  is  available,  the 
application  of  the  molecular  replacement  methods,  i.e.  the  use  of  rotation  and 
translation  function  (Rossmann  1972),  may  be  fruitful  in  arriving  at  the  structure. 

The  application  of  either  the  method  of  isomorphous  replacement  or  the  molecular 
replacement  method  results  in  an  electron  density  map  calculated  from 

p(x,  y,  z)  =  Z Z  l^o(^^0 1  exp  i[2n{hx  +  ky  +  Iz)  -  ot{hk[)],  (1) 

where  \F{hkl}  \  is  the  structure  amplitude  of  the  reflexion  {hkl)  and  (x{hk[)  is  the  phase 
angle  calculated  by  the  methods  mentioned  above.  The  electron  density  maps  thus 
obtained  are  then  interpreted  either  in  an  optical  comparator  (Richards  1968)  or  in  a 
graphics  display  (Jones  1978)  and  the  structure  of  the  protein  built  up.  This  is  of  course 
a  very  critical  operation  and  the  success  of  the  interpretation  depends  on  the  quality  of 
the  electron  density  maps  and  the  availability  of  atleast  partial  amino  acid  sequence  of 
the  protein  (Kannan  1977).  The  quality  of  the  electron  density  map  is  dependent  on  the 
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quality  of  the  diffraction  data  collected,  i.e.  the  accuracy  of  the  measurement  of  the 
structure  amplitude  for  the  native  protein  and  the  heavy  atom  derivatives.  The  quality 
of  the  map  is  dependent  also  on  the  quality  of  the  phases  a(hkl).  These  are  entirely 
dependent  on  the  quality  of  and  the  number  of  heavy  atom  derivatives  used  in  the 
calculation  in  the  case  of  the  isomorphous  method  and  on  the  degree  of  structural 
homology  between  the  known  and  unknown  structure  in  the  case  of  the  molecular 
replacement  method. 

It  is  important  to  stress  that  the  quality  of  the  crystals  used  and  their  stability  to  x-ray 
radiation  are  primary  requisites  to  a  successful  solution  of  the  protein  structure.  Thus 
crystallization  of  the  native  protein  and  the  heavy  atom  derivatives  assumes  added 
significance  as  these  are  by  no  means  well-established  techniques  like  the  rest  of  the 
protein  structure  determination. 

The  structure  once  determined  as  outlined  above  forms  the  basis  for  a  m^ore 
thorough  analysis.  The  coordinates  obtained  are  subjected  to  crystallographic 
refinement  by  a  combination  of  least  squares  methods,  and  most  important  of  all, 
inspection  and  interpretation  of  difference  electron  density  maps  where  the  amplitude 
\F{hkl)\  is  now  either  -  l^cai(^^Ol)  or  {\2F^^,^{hkl)\-\F^i{hkl)\).  During 

refinement  the  solvent  molecules  are  also  interpreted  and  their  coordinates  in¬ 
corporated  to  get  a  complete  tertiary  structure.  It  must  be  stressed  here  that  without 
locating  the  solvent  molecules  which  are- part  of  the  protein  structure,  the  determi¬ 
nation  of  the  tertiary  structure  of  a  protein  is  incomplete.  Another  important  purpose 
of  the  refinement  is  to  identify  and  include  those  amino  acid  residues  which  were  not 
interpretable  or  wrongly  identified  during  the  course  of  the  interpretation  of  the 
isomorphous  or  molecular  replacement  electron  density  maps. 

A  study  of  a  protein  structure  is  in  itself  very  rewarding.  However,  the  use  of  the 
structure  results  towards  understanding  functions  of  the  protein  is  still  more  important 
and  should  be  a  natural  goal  for  the  protein  crystallographer.  This  requires  the 
availability  of  the  accurate  native  protein  structure  at  different  pH  values  and  also  the 
accurate  structure  of  the  protein-substrate  and  protein-inhibitor  complexes.  Thus  it  is 
always  necessary  to  obtain  the  structure  of  the  protein  under  these  varying  conditions,  a 
not  so  trivial  task  in  terms  of  data  collection  and  computation  and  electron  density  map 
inspection  and  not  least  the  assimilation  and  interpretation  of  the  biochemical- 
biophysical  and  enzyme  kinetic  data  with  reference  to  the  structure  of  the  protein  in  the 
different  environments.  It  is  thus  apparent  that  a  successful  macromolecular  work 
requires  a  well  organised  infrastructure  facility  in  terms  of  powerful  x-ray  diffraction 
equipment,  a  well-equipped  biochemical  laboratory  facility,  automatic  diffractometry 
or  film  recording  equipment,  automatic  film  measuring  facility  and  not  least  of  all  good 
high  speed  computing  facility  and  associated  computer  programs  for  doing  the  varied 
calculations  needed  for  protein  structure-function-inhibition  analysis. 

In  the  subsequent  sections  we  will  discuss  the  infrastructural  facility  at  Trombay  we 
have  set  up  from  1979  onwards  and  also  the  results  of  the  macromolecular  analysis  we 
have  undertaken  here. 

3.  Infrastructure  at  Trombay 

3.1  Instrumentation  and  laboratory  facilities 

The  protein  crystallography  group  has  a  modest  biochemical  facility  for  the 
rudimentary  but  necessary  protein  purification  work  and  has  also  access  to  the 

U 


210 


K  K  Kannan  and  R  Chidambaram 


instrumentation  at  the  Biochemistry  Division  of  the  Bhabha  Atomic  Research  Centre. 
The  equipment  set  up  by  the  group  consists  of  a  locally  manufactured  cold  cabinet  for 
maintaining  the  temperature  of  samples  during  preparation  at  4°C,  an  automatic 
fraction  collector  designed  and  fabricated  by  the  Central  Workshops  and  the  Reactor 
Control  Division,  barc  (Koppikar  et  al  Personal  communication),  apart  from 
other  essential  instruments  like  pH  meters,  gel  electrophoresis  apparatus  and  a 
binocular  wide  field  microscope  for  crystal  viewing  during  crystallization  and  also 
for  crystal  mounting  for  x-ray  diffraction  work.  Many  of  the  crystallization  ap¬ 
paratuses  (Blundell  and  Johnson  1976)  like  zeppezaur  capillaries,  vapour  diffusion 
chambers  for  normal  mode  and  hanging  drop  mode  have  also  been  made  by  the  group. 
The  group  has  successfully  crystallized  proteins  and  reported  the  first  ever  protein 
crystallization  and  crystal  data  in  India  using  these  and  other  facilities  like  dialysis  and 
solution  interfacing  in  melting  point  capillaries.  Snake  venom  toxin  proteins  Notexin 
and  Notechis  II-5  have  been  crystallized  at  Trombay  and  are  currently  under 
investigation  by  x-ray  crystallographic  methods  (Kannan  et  al  1981). 

The  x-ray  diffraction  equipment  consists  of  a  stabilized  x-ray  generator  designed  and 
fabricated  by  the  scientists  at  barc,  Bombay  and  vec,  Calcutta  and  a  high  brilliance 
rotating  anode  (the  latter  is  also  used  for  energy-dispersive  high-pressure  diffraction 
research).  The  vec-barc  generator  uses  the  Phillips  fine  focus  x-ray  tubes  with  copper 
anode.  For  photographic  data  collection,  there  is  a  precession  camera  and  an  Arndt- 
Wonacott  oscillation  camera.  It  is  also  being  proposed  to  modify  an  indigenous 
microprocessor-controlled  x-ray  diffractometer  design  to  suit  the  crowded  reciprocal 
lattices  of  macromolecular  crystals. 

The  x-ray  diffraction  films  are  processed  on  a  computer-controlled  two-dimensional 
microdensitometer,  which  is  also  suitable  for  processing  radiographs  and  other  images 
on  films.  The  map  interpretation  is  done  on  a  three-dimensional  graphics  display 
system  available  with  the  Computer  Section  at  BARC.The  group  uses  mostly  the  Prime 
450  with  the  Computer  Section  at  barc  and  the  vax  11/780  at  tier  for  their  current 
computational  needs. 

3.2  Computer  programs 

Macromolecular  structure  determination  and  refinement  requires  a  large  number  of 
computer  codes.  Often  the  programs  used  by  the  group  have  been  available  on 
computers  other  than  those  accessible  to  them.  This  has  necessitated  the  conversion  of  a 
large  number  of  computer  codes  for  the  protein  group’s  day-to-day  needs.  A  major 
programming  effort  has  been  that  for  the  online  control  of  the  microdensitometer  by 
the  PDF  11/34  computer  under  the  RSXllM  operating  system.  The  group  has  written 
the  driver  routine  (Pal  et  al  1983)  for  the  control  and  also  converted  Allan  Wonacott’s 
film  processing  programs  written  originally  for  a  Nova  computer  system  to  the  pop 
11/34  system  under  the  RSXl  IM  operating  system  using  the  driver  developed  at  barc. 
These  include  programs  to  index  still  photographs,  processing  of  precesion,  oscillation 
photographs  and  absorption  streak  photographs  (Pal  et  al  1983)  and  programs  for  film 
scaling  and  Lorentz  and  polarization  corrections. 

The  data  obtained  from  the  data  processing  programs  above  are  being  handled  by 
the  Protein  program  package  kindly  made  available  to  us  by  Dr.  Steigeman,  Max 
Planck  Institute  for  Biochemistry,  Munich,  Germany  for  the  vax  11/780  computer. 
This  also  includes,  among  other  things,  the  structure  factor  and  electron  density 
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calculation  programs.  This  program  also  forms  the  essential  interface  to  the  map 
manipulation  programs  so  essential  to  use  on  the  graphics  system.  The  maps  thus 
prepared  are  transmitted  to  the  pop  11/34  through  a  computer  link  from  the  Prime  450 
computer,  the  software  for  which  has  been  written  at  barc  (R  S  Mundada  and  R  K 
Talwar,  Private  Communication)  for  interpretation  on  the  graphics  with  the  frodo 
programme  package  (Jones  1978). 

The  refinement  of  the  heavy  atom  derivatives  and  the  calculation  of  the  isomorphous 
phases  are  possible  through  a  number  of  programs  available  to  the  group  (Jarup  et  al 
1970)  and  also  on  the  vax  11/780  computer  with  the  Munich  program  system. 

The  protein  crystallography  group  has  made  effective  use  of  the  restrained  least 
squares  (Hendrickson  and  Konnert  1980)  and  the  constrained-restrained  least  squares 
(coRELs)  (Sussman  et  al  1977)  methods  to  refine  the  protein  structures  described  below 
(Kannan  and  Ramanadham  1981;  Ramanadham  et  al  1980, 1981;  Ramanadham  1982). 
The  program  systems  used  for  this  purpose  have  been  implemented  on  several 
computers.  All  the  above  programs  have  been  used  extensively  to  refine  the  human 
carbonic  anhydrase  B  and  triclinic  hen  egg  white  lysozyme  structures.  Interpretation  of 
the  electron  density  maps  also  requires  a  complete  set  of  contour  maps  drawn  on 
transparent  plastic  sheets,  as  a  number  of  sections  of  the  unit  cell  are  stacked  together  in 
a  suitable  way.  This  is  done  by  the  program  pluto  which  also  gives  the  molecular 
skeleton  plotted  and  superposed  on  the  contour  maps.  This  program  is  operational  on 
Prime  450.  A  fast  fourier  transform  (fft)  program  to  calculate  the  electron  density 
sections  which  is  input  to  pluto  is  also  operational  on  Prime  450. 

For  the  use  of  the  molecular  replacement  method  the  fast  rotation  function  program 
of  Crowther  (Rossmann  1972),  the  slower  rotation  function  program  of  Rossmann 
(1972)  as  also  the  translation  function  program  are  available.  Stereo-drawings  of  the 
molecular  model  or  selected  parts  of  the  molecule  are  obtainable  by  the  use  of  the 
program  ortep2. 

4.  Structure  reflnement  and  function  of  proteins 

We  give  below  a  summary  of  the  protein  crystallographic  studies  carried  out  at 
Trombay  at  the  same  time  as  the  necessary  infrastructure  facilities  for  experimental 
macromolecular  crystallography  research  were  being  built  up. 

4.1  Presynaptic  toxins 

A  programme  of  structure-function  studies  on  presynaptic  toxin  phospholipases  has 
been  initiated  at  Trombay.  The  presynaptic  toxins  are  widely  distributed  in  nature  and 
are  present  in  good  quantities  in  the  venom  of  some  snakes.  They  are  broadly  classified 
into  monomeric  toxins  and  multimeric  toxins.  They  also  possess  phospholipase  A2 
activity  and  the  amino  acid  sequence  shows  very  large  homology  with  snake  venom, 
bovine  and  porcine  phospholipases.  The  Australian  tiger  snake  (Notechis  Scutalus 
Scutalus)  is  a  rich  source  of  the  presynaptic  toxins  of  the  monomeric  type.  There  are  a 
number  of  fractions  (Halpert  and  Baker  1976)  which  show  toxicity  and  phospholipase 
activity  and  also  some  which  have  only  sequence  homology  but  no  toxicity  or 
phospholipase  activity.  Three  of  these  proteins  are  under  investigation  at  Trombay. 
They  are  Notechis  II-4  or  Notexin,  Notechis  II- 1  and  Notechis  II-5.  Of  these  Notechis 
II-l  shows  no  toxicity  and  Notexin  is  more  toxic  than  Notechis  II-5.  These  toxins  are 
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basic  proteins  with  119  amino  acid  residues  cross-linked  by  several  disulphide  bridges. 
Notechis  II-5  was  the  first  protein  to  be  crystallised  and  characterised  in  India  and  was 
reported  by  Kannan  et  al  (1981).  The  homology  with  porcine  and  bovine  phos¬ 
pholipases  will  be  used  and  the  structure  solution  attempted  by  molecular  replacement 
methods.  It  may  be  possible  to  arrive  at  the  toxicity  centre  of  these  toxins  by  comparing 
their  three-dimensional  structures  which  will  help  in  finding  the  mechanism  of 
presynaptic  toxin  action.  The  unit  cell  dimensions  of  these  two  toxins  (Kannan  et  al 
1977a)  are 

Notexin  :  a  —  b  =  75  03  A,  c  =  49  04  A,  a.  =  (i  =  90°  and  y  =  120°.  6  mol¬ 

ecules  in  a  trigonal  unit  cell  with  space  group  P3i21  or  P3221 
Notechin  II-5  :  a  —  146  A,  b  =  43-5  A  and  c  =  39  0  A,  a  =  )?  =  y  =  90°.  8  mol¬ 
ecules  in  an  orthorhombic  unit  cell  with  space  group  P2i2i2i. 

4.2  Lysozyme 

Lysozyme  was  the  first  enzyme  and  the  second  protein  whose  structure  was  resolved  at 
high  resolution  by  x-ray  diffraction  technique  (Blake  et  al  1965).  The  enzyme  lyses 
polysaccharide-like  polymers  of  N  acetyl  glucosamine  and  N-acetyl  muramic  acid. 

Lysozyme  is  a  monomer  of  129  amino  acids  of  molecular  weight  14  600.  There  are 
four  disulphide  bridges.  Hen  egg  white  lysozyme  has  been  crystallised  in  a  number  of 
crystallographic  forms.  Blake  et  al  (1965)  investigated  the  structure  of  the  tetragonal 
form  of  the  enzyme  crystals.  Jensen  and  colleagues  (Ramanadham  et  al  1980)  have 
studied  the  triclinic  modification  for  which  x-ray  diffraction  data  extends  to  about  1  A 
resolution.  The  initial  structure  in  the  triclinic  unit  cell  was  derived  by  the  application  of 
molecular  replacement  methods  and  subsequently  the  structure  was  refined  by 
difference  fourier  methods  (Ramanadham  et  al  1980).  The  structure  was  then  refined  by 
Ramanadham  ct  a/  (1981)  by  restrained  least  squares  method. 

The  restrained  least  squares  refinement  of  triclinic  hen  egg-white  lysozyme  at  2  A 
resolution  was  initiated  and  carried  out  to  an  R-factor  of  0T79  at  the  LFniversity  of 
Washington,  Seattle,  by  Jensen  and  colleagues  (Ramanadham  et  al  1980).  This  work  has 
been  continued  at  Trombay  in  collaboration  with  the  Seattle  group.  The  present 
R-factor  is  0T69. 

The  restrained  least  squares  method  utilizes  the  known  geometry  of  the  different 
protein  groups  and  introduces  various  geometrical  restraints  in  the  least  squares 
procedure  as  a  set  of  additional  observations,  thus  improving  the  overdeterminacy 
ratio. 

The  quantity  minimised  is 

-i-  O2  +  O3  +  O4.  -f-  . . . ,  (2) 

where  <l>i  is  the  conventional  structure  factor  term  in  the  least  squares  procedure. 
^i=Y^{\Fo\-\Fc\)\ 

02,  O3,  etc.  are  terms  for  the  different  restraints  for  planar  groups,  atomic  distances, 
chiral  volumes,  non-bonded  contacts,  isotropic  thermal  parameters,  conformational 
parameters,  symmetry  features  and  so  on  (Hendrickson  and  Konnert  1980). 

In  the  refinement  of  Lysozyme  (Ramanadham  et  al  1981)  4961  parameters  were 
adjusted  by  the  restrained  least  squares  procedure.  There  were  1001  protein  and  239 
solvent  atoms  and  one  scale  factor  as  against  7131  independent  Fobs  structure 
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amplitude  values  within  2  A  resolution  and  about  3000  restraints.  The  root  mean 
square  deviation  in  the  distances  between  different  atoms  from  ideal  values  were  less 
than  0-05  A  for  the  final  model. 

Protein  crystals  suffer  from  static  and  dynamic  disorder  and  it  would  be  difficult  to 
assign  a  meaning  to  the  temperature  factor  in  a  conventional  sense.  However  it  has  been 
noticed  that  the  temperature  factor  of  the  protein  atoms  in  triclinic  lysozyme  was  very 
low  compared  to  many  other  protein  structures.  The  average  B  value  for  the  molecule  is 
about  8  0  A},  much  smaller  than  in  tetragonal  lysozyme  and  this  seems  to  be  related  to  a 
larger  fraction  of  the  water  molecules  being  ordered  in  this  structure  (Ramanadham  et 
al  1981). 

Mn^^,  Co^"^  and  Gd^"^  have  been  found  to  bind  to  the  active  site  of  Lysozyme 
(Kurachi  et  al  1975  and  Teichberg  et  al  1974).  It  has  also  been  observed  that  divalent 
copper  ions  inhibit  lysozyme.  The  binding  of  Cu^  to  tetragonal  lysozyme  has  been 
reported  by  Teichberg  et  al  (1974),  Ramanadham  et  al  (1981)  have  studied  the  binding 
of  Cu^'*^  to  triclinic  lysozyme  at  Trombay  using  the  2  A  data  collected  at  University  of 
Washington.  The  structure  of  the  Cu^^  lysozyme  complex  was  refined  by  restrained 
least  squares  method  using  the  refined  coordinates  of  the  triclinic  lysozyme  structure  to 
a  crystallographic  R-factor  of  16-9  %  after  two  cycles  of  refinement.  Ramanadham  et  al 
(personal  communication)  have  not  observed  any  significant  deviations  of  the  protein 
atoms  on  copper  binding.  They  had  observed  that  one  Cu^'^  ion  was  found  in  the 
vicinity  of  Glu  35  in  the  active  site  and  a  second  copper  ion  was  bound  to  the  enzyme  in 
the  vicinity  of  His  15  residue. 

4.3  Human  carbonic  anhydrase 

Mammalian  erythrocytes  are  a  rich  source  of  carbonic  anhydrase  enzyme  which 
reversibly  catalyses  the  interconversion  of  bicarbonate  to  carbon  dioxide  and  water. 
The  human  erythrocytes  contain,  apart  from  the  major  component  haemoglobin,  two 
carbonic  anhydrase  isoenzymes,  form  B  (hcab  )  and  form  C  (hcac  ).  The  two  isoenzymes 
are  homologous  to  each  other  with  about  260  amino  acid  residues  and  an  essential 
Zn^^  ion  and  have  a  molecular  weight  of  30000.  There  are  no  disulphide  bridges  in 
these  enzymes.  The  isoenzymes  differ  in  their  catalytic  efficiency.  Aromatic  sul- 
phonamides  which  are  specific  inhibitors  of  these  enzymes  also  exhibit  differing 
inhibitory  properties.  The  enzymes,  however,  are  highly  efficient  in  the  catalysis  of  CO2 
bicarbonate  interconversion  with  a  turnover  rate  of  10^  moles  of  CO2  per  mole  of 
enzyme  per  sec.  The  mechanism  of  action  of  these  fascinating  enzymes  are  still  hotly 
debated  by  the  different  workers  in  the  field  (Lindskog  et  al  1971). 

The  three-dimensional  structure  of  hcab  (Kannan  et  al  1975)  and  hcac  have  been 
now  known  for  some  time  (figure  1)  (Kannan  et  al  1971  and  Liljas  et  al  1972).  Kannan 
et  al  (1981)  have  been  refining  the  structure  of  hcab  at  2  A  resolution  to  get  the  complete 
tertiary  structure  of  the  enzyme  including  the  solvent  structure  in  the  crystals.  The 
solvent  structure  may  play  an  important  role  in  the  mechanism  of  action  of  the  enzyme. 
The  enzymes  also  show  a  different  folding  pattern  compared  to  many  of  the  other 
protein  structures  known.  The  molecule  is  twice  as  large  as  lysozyme  discussed  above. 
The  diffraction  data  had  been  collected  at  Uppsala,  Sweden  on  photographic  films 
using  precession  cameras  (Kannan  et  al  1975)  during  the  course  of  the  high  resolution 
structure  determination.  The  coordinates  of  hcab  had  been  measured  from  a  Kendrew 
Watson  skeletal  model  and  the  model  adjusted  to  conform  to  stereochemical  standards 


214 


K  K  Kannan  and  R  Chidambaram 


COj4HjO^HCO>H* 


Figure  1.  Schematic  diagram  of  the  three-dimensional  structure  of  carbonic  anhydrase 
molecule  with  the  amino  acid  sequence  of  the  HCAB  and  HCAC  in  the  one  letter  code. 


using  the  model  building  program  (Diamond  1971).  The  crystallographic  JR-factor  was 
about  50%.  The  model  had  been  fitted  to  the  isomorphous  map  in  a  3D  graphics 
display  by  one  of  us  at  Max  Planck  Institute  for  Biochemistry,  Munich,  Germany  using 
the  programme  frodo  (Jones  1978)  and  the  resultant  model  had  been  refined  with  the 
real  space  refinement  program  (Diamond  1974)  on  a  Siemens  4004  computer  at 
Munich.  The  P-factor  was  39-5  %  for  the  14578  observed  reflections  in  the  5  to  2  A 
sphere.  The  weak  unobserved  reflections  had  been  dropped  at  this  stage. 

The  structure  was  then  refined  by  Kannan  and  Ramanadham  (1981)  at  Trombay 
using  the  restrained  least  squares  programs,  discussed  earlier  in  the  section  on 
lysozyme,  to  36-5  %  R-value  for  the  3  A  resolution  data  and  later  to  32-3  %  for  the 
14524  reflection  in  the  5  to  2  A  sphere  where  individual  isotropic  temperature  factors 
were  used  in  the  refinement.  However  the  improvements  in  R-factors  after  8  cycles  of 
least  squares  refinement  tapered  off.  The  model  was  now  compared  with  the  electron 
density  maps  and  wrong  orientation  of  the  residues  corrected.  The  three-dimensional 
graphics  facility  at  the  Wallenberg  Laboratory,  Uppsala,  Sweden  was  used  during  a 
visit  by  one  of  us  for  this  purpose  in  collaboration  with  Dr  T  A  Jones  (Kannan  et  al 
1982).  The  model  was  refitted  to  the  isomorphous  electron  density  map  in  the  graphics 
system.  Kannan  et  al  (1982)  have  reported  that  the  model  required  refitting  in  a  number 
of  regions  and  also  a  number  of  side  chains  and  even  main  chain  atoms  dropped  from 
the  refinement  due  to  difficulties  in  fitting  to  the  isomorphous  electron  density  map.  Of 
the  total  of  2200  protein  atoms  they  had  included  only  about  1850  atoms  in  the 
constrained-restrained  least  squares  procedure  corels  with  one  overall  temperature 
factor.  The  R-factor  had  dropped  after  2  cycles  from  45-5  %  for  the  refitted  model  to 
36  %  for  the  6  to  2  A  data. 
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They  then  fitted  the  model  to  a  (|2Fo|  —  |Tc|)  electron  density  map  in  the  graphics 
system.  In  this  process  a  number  of  residues  which  were  left  out  from  the  previous 
calculation  were  located  and  included  in  further  refinement.  In  all  they  had  done 
alternatively  4-model  fitting  sessions  to  the  electron  density  maps  on  the  interactive 
graphics  system  and  8  cycles  of  corels.  At  this  stage  only  4  residues  at  the  amino 
terminus,  some  side  chains  especially  lysines  and  all  the  solvent  atoms  were  not 
included.  The  K-factor  had  dropped  to  31  %  with  one  overall  temperature  factor  and  to 
26-7  %  while  refining  two  group  temperature  factors  for  every  residue  in  the  last  two 
cycle  (Kannan  et  al  1982). 

The  refinement  has  since  been  continued  at  Trombay.  About  60  solvent  molecules 
located  within  hydrogen  bonding  distance  to  relevant  proteins  groups  from  (|Fo| 
—  I  F<,  I )  electron  density  map  were  included  and  individual  restrained  isotropic 
temperature  factors  refined  for  all  atoms  using  the  restrained  least  squares  procedure 
outlined  in  the  section  on  lysozyme.  After  2  cycles  the  F-factor  had  dropped  to  22-7  %. 
The  refinement  using  corels  also  gave  an  almost  identical  result  giving  a  i?-factor  of 
22-5%.  An  inspection  of  the  (|Foi  ~  |^c|)  electron  density  map  resulted  in  40  more 
solvent  atoms  being  located,  within  hydrogen  bonding  distance  to  the  relevant  protein 
group  or  solvent  atoms  located  in  the  earlier  ( |  Fq  |  —  |  F<.  | )  map.  The  interactive  three- 
dimensional  graphics  facility  became  available  at  this  time  at  Trombay  and  the  frodo 
program  (Jones  1978)  system  was  made  operational  on  the  PDF  11/34  computer 
system  connected  to  the  graphics  system  by  one  of  us.  A  (]2Fo|  —  |Fc|)  map  was  then 
used  to  fit  the  refined  coordinates  and  the  solvent  atoms  (Kannan  et  al,  personal 
communication).  They  have  reported  correcting  the  wrongly  oriented  side  chains, 
especially  the  forked  residues  for  a  number  of  side  chains.  They  also  located  another  60 
solvent  atoms  from  an  inspection  of  the  (|2Fo|  —  |Fj)  and  the  ([Fq]  —  |F<,j)  maps.  A 
number  of  side  chain  conformational  angles  were  also  corrected  to  near-staggered 
orientation  in  conformity  with  the  electron  density  on  the  graphics  system.  It  was 
observed  by  them  that  none  of  the  least  squares  refinement  programs  was  capable  of 
correcting  such  errors  and  often  forcing  the  residues  to  false  minima.  Thus  only  manual 
intervention  could  rectify  such  faults.  After  three  more  cycles  of  restrained  least  squares 
refinement  the  F-factor  had  dropped  to  19-5  %  (figure  2)  for  the  reflections  in  the  5  to 
2  A  sphere.  All  the  15524  reflection  in  the  10-2  A  sphere  were  then  used  by  them,  and  a 
( I  2Fq  I  —  I  Ff  I )  map  and  a  ( |  Fq  [  —  |  F^  | )  map  computed  and  used  on  the  graphics  system 
for  further  improvement  in  the  model-electron  density  fittings  and  solvent  localization. 
Kannan  et  al  (personal  communication)  were  able  to  identify  two  more  of  the  amino 
terminal  residues,  all  but  two  lysine  side  chains  and  an  arginine  side  chain  as  also  about 
150  solvent  atoms  either  hydrogen-bonded  to  the  protein  or  to  the  solvents,  located 
earlier.  Thus  about  300  solvent  atoms  have  been  added  to  the  protein  structure  so  far. 

They  also  have  found  some  interesting  solvent  structure  in  the  hcab  molecule.  They 
found  that  the  catalytically  important  Zn^  has  a  solvent  coordinated  to  it  and  is  within 
hydrogen  bonding  distance  to  Thr  199.  There  is  a  solvent  molecule  bridging  Glu  106 
and  Tyr  7  by  hydrogen  bonding  to  them.  Glu  106  was  also  found  hydrogen-bonded  to 
Thr  199.  A  solvent  hydrogen-bonded  by  Tyr  6  was  also  found  hydrogen-bonded  to  His 
64  located  in  the  active  site  of  the  enzyme.  Another  interesting  feature  of  the  solvent 
structure  seems  to  be  their  participation  in  breaking  pleated  sheet  hydrogen  bonding.  A 
few  such  solvents  have  been  located  by  Kannan  et  al  (private  communication). 

Another  important  feature  reported  by  Kannan  et  al  is  regarding  the  temperature 
factors.  Internal  residues  and  residues  in  the  pleated  sheet  structure  have  very  low 


216 


K  K  Kannan  and  R  Chidambaram 


Figure  2.  A  plot  of  the  descrepancy  factor  versus  least  squares  cycle  number  in  the 
refinement  of  HCAB.  Reproduced  with  permission  from  Kannan  et  al. 


temperature  factor,  in  some  instance  2  0  A^,  and  residues  on  the  surface  of  the  molecule, 
especially  lysines,  and  other  polar  residues  have  very  high  temperature  factors, 
sometimes  as  high  as  30  A^.  The  solvents  also  have  varying  temperature  factor  from 
6  A^  to  60  A^  depending  on  their  location  and  hydrogen  bonding.  The  average  B- value 
for  all  the  protein  atoms  is  12  A^. 

The  refined  coordinates  of  hcab  has  been  used  to  calculate  ( 1 2  Fq  |  —  |  f c  | )  maps  for  a 
number  of  inhibitor  complexes  and  one  metal  replacement  for  2  A^  at  Trombay.  The 
results  of  these  investigation  agree  well  with  those  reported  earlier  by  Kannan  et  al 
(1977b  and  1977c)  and  Kannan  (1980a  and  1980b).  The  mechanistic  proposal  made  by 
Kannan  et  al  (1977b)  would  still  be  valid  with  minor  modification  in  the  light  of  the 
solvent  structure  found  during  the  course  of  the  refinement. 

5.  Conclusion 

Neutron  beam  research  at  Trombay  on  small  biomolecules  will  be  enlarged  in  scope 
with  the  advent  of  the  higher-flux  R-5  reactor  to  include  small  angle  neutron  scattering 
studies  of  large  biomolecules  and  bio-aggregates.  We  are  now  also  fully  equipped  at 
Trombay  to  undertake  macromolecular  crystallography  research  with  x-rays.  The 
experimental  facilities  have  been  installed  and  a  complete  library  of  computer  programs 
established.  Meanwhile  a  number  of  protein  structures  have  been  refined  and 
interesting  results  obtained  on  protein  dynamics  and  structure-function  relationships. 
We  have  also  taken  up  the  study  of  several  new  proteins. 
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On  the  polymorphism  of  the  smectic  A  phases  of  highly  polar 
compounds 


N  V  MADHUSUDANA 

Raman  Research  Institute,  Bangalore  560080 

Abstract.  The  smectic  A  phase  is  characterized  by  a  layered  arrangement  of  rod-like 
molecules,  with  the  average  orientation  of  the  molecules  normal  to  the  layers  and  a  liquid-like 
organization  within  the  layers.  When  the  molecules  have  a  strongly  polar  end  group  (like 
-CsN,  -NO2  etc.,),  neighbouring  molecules  prefer  an  antiparallel  orientation.  This  results  in  a 
bilayer  structure,  with  a  layer  spacing  much  larger  than  the  molecular  length.  In  many 
compounds,  the  structure  of  the  bilayer  is  sensitive  to  temperature  (or  pressure).  This  results  in 
some  interesting  phenomena:  (a)  reentrant  phases,  as  for  example,  the  appearance  of  a  nematic 
phase  below  the  temperature  range  of  existence  of  the  A  phase;  (b)  a  number  of  polymorphic 
forms  of  smectic  A;  and  (c)  an  ‘induced  smectic  A’  phase  when  the  compounds  with  the  highly 
polar  terminal  groups  are  mixed  with  weakly  polar  compounds  even  when  neither  compound 
by  itself  exhibits  the  A  phase.  This  article  gives  a  brief  summary  of  recent  results  on  all  these 
aspects. 

Keywords,  smectic  A;  polymorphism;  reentrant  phases;  bilayer  smectics;  induced  smectic 
phases. 


1.  Introduction 

The  smectic  phases  of  liquid  crystals  are  characterized  by  a  layered  arrangement  of  rod¬ 
like  molecules  (for  a  general  introduction  to  liquid  crystals,  see  Chandrasekhar,  1977; 
de  Gennes  1975).  Smectic  A  is  the  simplest  kind  of  smectic  phase  with  a  liquid-like 
organisation  within  the  layers,  and  the  average  orientation  of  the  long  axes  of  the 
molecules  (indicated  by  a  unit  vector  n  called  the  director)  normal  to  the  layers.  A 
drawing  of  the  structure  is  shown  in  figure  1.  However  this  figure  is  purely  schematic.  In 
reality,  the  periodicity  along  the  z-axis  is  very  well  described  as  a  sinusoidal  density 
wave  (McMillan,  1971;  de  Gennes  1972)  given  by 

p  =  Po[l+Re{|tA|e'(‘?oz  +  -^)}]  (1) 


Figure  1 .  An  idealised  schematic  drawing  of  the  arrangement  of  molecules  in  the  smectic  A 
phase. 
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where  =  2nld,  d  being  the  layer  spacing  and  </>  a  phase  factor  which  fixes  the  position 
of  the  layers  and  \il/\  the  amplitude  of  the  density  wave  (figure  2).  Indeed  even  the 
second  order  maximum  is  hardly  seen  in  the  x-ray  diffraction  pattern  from  most  of  the 
compounds  in  the  A  phase  (Schaetzing  and  Litster  1979).  The  first  order  reflection  is  also 
not  a  true  Bragg  spot  but  has  a  power  law  decay  of  the  intensity  as  one  moves  away  from 
q^.  This  is  a  consequence  of  the  well-known  Landau- Peierls  instability  of  an  one¬ 
dimensional  periodic  order  (Landau  and  Lifshitz  1959;  Peierls  1934). 

The  chemical  structure  of  a  typical  mesogenic  compound  with  rod-like  molecules  has 
a  central  aromatic  core  which  is  quite  rigid  and  a  relatively  flexible  alkyl  chain  at  one  or 
both  ends  (figure  3).  Experimentally  it  is  well  known  that  the  lower  homologues  {i.e., 
compounds  with  shorter  alkyl  chains)  exhibit  only  the  nematic  (N)  phase  which  is 
characterised  by  a  purely  orientational  long  range  order.  The  smectic  phase  makes  its 
appearance  when  the  alkyl  chain  is  relatively  long.  The  first  member  exhibiting  the  A 
phase  shows  a  relatively  weak  A-N  transition,  but  as  the  chain  length  is  increased,  the 
strength  of  the  transition  increases  and  beyond  a  certain  chain  length,  the  A  phase  goes 
over  directly  to  the  isotropic  phase.  These  observations  can  be  understood  by  assuming 
that  the  strongest  intermolecular  attractive  interactions  are  confined  to  the  aromatic 
cores  of  the  molecules.  This  would  naturally  lead  to  the  formation  of  a  layered 
arrangement  of  molecules  with  relatively  long  alkyl  chains  at  both  ends.  This  is  the 
physical  idea  behind  the  molecular  theory  of  the  A  phase  proposed  by  McMillan  (1971). 
Assuming  that  the  translational  and  orientational  orders  are  coupled,  he  showed  that 
the  A-N  transition  can  be  of  second  order  character  if  the  orientational  order  at  the 
transition  point  is  sufficiently  high. 

As  we  saw  earlier,  the  order  parameter  characterising  the  A  phase  has  two 
components  viz  the  amplitude  and  the  phase  of  the  density  wave  (equation  (1)).  This  is 
rather  analogous  to  the  order  parameter  of  a  superconductor.  Further,  because  of  the 
layered  arrangement,  the  A  phase  does  not  allow  a  curl  n  distortion  which  requires  a 
change  of  layer  spacing.  This  is  analogous  to  the  expulsion  of  a  magnetic  field  by  a 


Figure  2.  A  realistic  distribution  of  molecules  in  the  A  phase  (from  Schaetzing  and  Litster 
1979). 


Figure  3.  Chemical  structure  of  a  typical  mesogenic  compound. 
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superconductor.  The  close  analogy  between  the  (second-order)  A-N  transition  and 
superconductor-normal  metal  transition  was  recognised  by  de  Gennes  (1972)  who 
wrote  the  relevant  Landau-Ginzburg  free  energy  functional.  Using  this  analogy,  he 
predicted  the  divergence  of  twist  and  bend  elastic  constants  as  (the  A-  N  transition 
point)  is  approached  from  the  nematic  side.  [For  a  review  of  the  experimental  situation 
up  to  1978  see  Chandrasekhar  and  Madhusudana  1978.  For  recent  results  see  Als- 
Nielsen  1981].  He  also  predicted  a  reduction  in  due  to  a  curl  n  type  of  distortion 
(which  was  only  recently  confirmed  experimentally;  Madhusudana  and  Srikanta  1982). 
However  the  analogy  is  not  exact.  For  example,  it  has  been  found  that  <^||  and  the 
correlation  lengths  parallel  and  perpendicular  to  the  director  diverge  with  different 
critical  indices  as  is  approached.  The  theory  of  an  transition  still  continues  to  be  a 
very  active  area  of  investigation  (see  for  example,  Als-Nielsen  1981). 

In  this  article  we  shall  confine  our  attention  to  some  interesting  features  exhibited  by 
the  smectic  A  phase  when  the  compound  has  a  highly  polar  end  group.  Many 
compounds  have  been  synthesised  in  recent  years  which  have  either  a  cyano  (-C=N)  or 
nitro  (-NO2)  end  group,  both  of  which  have  a  longitudinal  dipole  moment  of  ~  4 
Debyes.  Till  about  a  decade  ago  the  longitudinal  dipoles  were  thought  to  play  only  a 
minor  role  in  stabilising  the  nematic  and  smectic  A  phases,  especially  after  it  was 
recognized  that  Born’s  theory  of  the  nematic  phase  (Born  1916)  which  is  based  entirely 
on  interactions  between  permanent  dipoles  is  inadequate  (Maier  and  Saupe  1959).  We 
proposed  that  for  minimising  the  mutual  interaction  energy,  neighbouring  molecules 
with  strong  longitudinal  dipoles  should  favour  an  antiparallel  arrangement 
(Madhusudana  and  Chandrasekhar  1973;  Madhusudana  et  al  1977).  This  was  amply 
confirmed  by  x-ray  and  neutron  scattering  studies  (Leadbetter  et  al  1975, 1979;  Lyden 
and  Coakley  1975)  which  showed  that  in  aUcyl  (or  alkoxy)  cyanobiphenyls  the  smectic  A 
phase  is  characterized  by  a  layer  spacing  which  is  ~  1-4  times  the  molecular  length 
(figure  4).  As  we  pointed  out  earlier,  the  attractive  interaction  between  neighbouring 
molecules  would  favour  aromatic  regions  to  overlap,  and  the  antiparallel  correlation 


Figure  4.  Schematic  diagram  of  antiparallel  local  structure  in  pentyl-cyanobiphenyl 
(Leadbetter  et  al  1975). 
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brought  about  by  the  strong  dipole  moments  would  then  lead  to  a  structure  as  shown  in 
figure  4.  Another  way  of  describing  such  a  structure  would  be  to  say  that  two  such 
molecules  would  form  an  antiparallel  ‘pair’,  which  should  be  a  relatively  stable  unit 
because  of  frustration  effects  which  any  additional  molecule  would  be  subjected  to  in 
the  neighbourhood  of  such  a  pair.  An  interesting  consequence  of  the  formation  of  such 
pairs  is  that  they  are  much  more  symmetric  than  the  individual  molecules  which  are 
highly  asymmetric  because  of  the  presence  of  the  alkyl  chain  only  at  one  end  of  the 
aromatic  core.  The  pairs  have  chains  at  both  ends  and  the  McMillan  argument 
regarding  the  conditions  of  formation  of  the  smectic  A  phase  is  now  applicable  for  the 
pairs  and  thus  many  such  compounds  with  cyano  end  groups  exhibit  the  A  phase  (de 
Jeu  1982).  The  A  phase  thus  formed  is  referred  to  as  a  bilayer  A  phase  for  obvious 
reasons.  Even  if  the  chain  length  is  too  short  to  form  the  A  pha.se,  such  compounds 
exhibit  a  strong  smectic-like  short  range  order  in  the  nematic  phase,  the  short  range 
order  having  bilayer  characteristics.  The  structure  of  the  bilayer  should  depend  on  the 
chemical  structure  of  the  aromatic  core.  Further,  in  any  given  case  the  bilayer  structure 
is  an  extra  degree  of  freedom,  which  can  be  influenced  by  variations  in  temperature, 
pressure  or,  in  the  case  of  mixtures,  the  composition.  This  in  turn  leads  to  several 
interesting  phenomena  which  will  be  discussed  in  the  following  sections. 

2.  Reentrant  phases 

Cladis  (1975)  discovered  that  some  binary  mixtures  of  compounds  with  cyano  end 
groups  exhibit  the  following  sequence  of  transitions  as  the  mixture  is  cooled  from  the 
isotropic  phase: 

isotropic  nematic  smectic  A  -+  nematic. 

The  translational  order  of  the  A  phase  is  lost  both  when  it  is  heated  and  cooled.  The 
nematic  phase  occurring  at  temperatures  lower  than  the  range  of  stability  of  the  smectic 
A  phase  is  called  the  ‘reentrant’  nematic  ((N^)  phase,  in  analogy  with  similar  reentrant 
phases  in  certain  superconductors  doped  with  magnetic  impurities  (Riblet  and  Winzer 
1971)  and  the  solid-liquid  ^He  transition  (Anderson  et  al  1963). 

Subsequently  Cladis  et  al  (1978)  found  that  some  pure  compounds  exhibited  the 
reentrant  nematic  phase  at  elevated  pressures  and  a  little  later  pure  compounds  were 
found  which  exhibited  the  reentrant  phase  even  at  atmospheric  pressure  (Madhusudana 
et  al  1979;  Hardouin  et  al  1979;  Subramanya  Raj  Urs  and  Sadashiva  1982).  During  the 
past  3  years  a  large  number  of  compounds  have  been  synthesized  which  exhibit 
reentrant  phases.  (For  a  recent  review  on  the  chemistry  of  these  compounds,  see  Tinh 
1983).  Table  1  gives  a  representative  list  of  compounds  which  exhibit  reentrant  phases. 
It  is  seen  that  some  compounds  not  only  exhibit  a  reentrant  nematic  phase,  but  also  a 
reentrant  smectic  A  (A^)  phase  which  is  obtained  on  cooling  the  N,^  phase. 

Though  one  might  expect  that  changes  in  the  bilayer  structure  should  be  responsible 
for  the  occurrence  of  the  N,j  phase,  x-ray  investigations  do  not  reveal  any  universal 
trend  in  the  thermal  variation  of  the  bilayer  spacing  close  to  the  AN  ^  transition  point.  In 
some  cases  which  exhibit  only  the  N,,  phase  below  the  A  phase,  the  bilayer  expands  as 
the  sample  is  cooled  to  T^^r  (Chandrasekhar  et  al  1 979).  This  expansion  is  also  reflected 
in  the  low  frequency  dielectric  dispersion  which  arises  due  to  the  flipping  of  the 
molecules  about  their  short  axes,  the  corresponding  activation  energy  changing 
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Table  1. 


STRUCTURAL  FORMULA  AND  NAME 


traiia-p-n  — decyloxy-oi.-  methyl  - 
p'-cyanophenyl  cinnamate 
I  10  OMCPC) 


nC,2 


4  -  cyanophonyl -3-methyt -4  (4  -  n  — 
dodecyl  benzoyloxy  1  benzoate 
I12CPMBBI 


SEQUENCE  OF  TRANSmONS  AS  THE  COMPOUND  IS  COaEO 


I  73.5  N56.8  Ad  32.5  Nr 


1 148  N  138.5  Ad59.8  Nr 


"  C8Hi,0  — CO.O— <^>-C  H=CH— (g>-CN 

4  -n  -  octyloxybenzoyloxy-4  -  cyano- 
stilbene  IT8I 


1  283  N  248  Ad  138  Nr  94.5  Ai 


r’CioHjiO— <g>-CH=CH— CQO^^>-CH=N-d;^>-CN 

4-  (4 -n  -  decyloxy  cinnamoyloxy  )  benzylidene- 
4"— cyanoanillne 


I  267N  257  Ad87  C  70 Nr 


distinctly  (Ratna  et  al  1979).  The  activation  energy  of  the  phase  itself  is  much  larger 
than  in  the  normal  N  phase.  But  in  compounds  exhibiting  the  A  ^  phase  also,  the  trend  is 
quite  different.  The  bilayer  spacing  decreases  rapidly  from  ~  1-25  /  to  IT  /  as  the 
temperature  of  the  A  phase  is  lowered  to  T^^r-  Iti  the  reentrant  smectic  A  phase  there  is 
a  temperature  independent  layer  spacing  which  is  ~  /,  i.e.,  the  A  ^  phase  is  a  monolayer 
smectic  (Hardouin  and  Levelut  1980).  Moreover,  the  A^  phase  gives  rise  to  a  diffuse 
scattering  of  x-rays  indicative  of  a  two-dimensional  superlattice  type  of  modulation  of 
the  layers  with  periodicities  of  ~  70  A  in  the  plane  of  the  layer  and  a  periodicity  of  ~  4 
layers  along  the  z-direction  (figure  5).  The  activation  energy  of  the  low  frequency 


Figure  5.  Schematic  diagram  of  the  structure  of  the  reentrant  smectic  phase.  The  arrows 
represent  the  cyano-end  groups  of  the  molecules. 
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dielectric  dispersion  in  the  phase  is  nearly  twice  that  of  the  bilayer  A  phase 
(Benguigui  and  Hardouin  1981).  Further,  the  phase  has  fluctuations  of  both  A^  and 
Ar  phases,  as  revealed  by  x-ray  and  dielectric  studies. 

Most  of  the  compounds  exhibiting  the  reentrant  phases  have  three  benzene  rings 
with  some  bridging  groups  in  between  in  the  aromatic  core  excepting  two  compounds 
which  have  only  two  benzene  rings  (see  table  1).  The  length  of  the  core  appears  to  play 
an  important  role  and  Tinh  (1983)  has  concluded  that  the  core  has  to  have  a  length  lying 
between  16  A  and  22  A  for  the  compounds  to  exhibit  the  reentrant  phase. 

There  have  been  some  attempts  to  develop  a  theory  for  the  occurrence  of  the 
phase  on  the  basis  of  the  antiparallel  orientations  of  neighbouring  molecules.  Assuming 
that  associated  pairs  are  formed  in  the  medium,  Cladis  (1979)  argued  that  as  the 
temperature  decreases  (or  equivalently  the  pressure  increases)  the  stability  as  well  as  the 
number  of  such  pairs  should  increase.  The  associated  pairs  are  obviously  much  less 
polar  than  the  individual  molecules.  If  it  is  now  assumed  that  the  lateral  interactions 
leading  to  the  formation  of  layers  are  greater  if  the  molecules  are  effectively  more  polar; 
it  is  clear  that  the  smectic  phase  stability  is  decreased  as  the  pairing  is  more  efficient. 
Further,  for  the  aromatic  part  of  the  molecular  pairs,  it  will  not  be  sterically 
advantageous  to  remain  in  the  layers  beyond  a  certain  density.  Equivalently,  one  can 
assume  that  there  is  an  optimum  density  for  the  stability  of  the  smectic  A  phase  of  the 
bilayer  kind.  Pershan  and  Prost  (1979)  have  used  this  idea  in  developing  a  Landau 
theory  of  the  reentrant  nematic-bilayer  smectic  A  transition.  By  adding  a  term  of  the 
form  g{p)  1 1// to  the  usual  Landau  theory  of  the  A-N  transition  where  1 |  is  the  smectic 
order  parameter,  and  expanding  g{p)  to  the  leading  power  in  {p  —  p^)  where  pQ  is  the 
optimum  density,  they  can  account  for  both  A-N  and  A-N  ^  transitions.  Further,  using 
this  model,  they  can  account  for  the  experimental  observation  that  the  bilayer  A  phase 
is  bounded  by  the  N  phase  in  the  PT  plane  (Cladis  et  al  1977;  for  a  more  detailed 
account  of  the  high  pressure  studies,  see  the  article  by  Shashidhar  1983).  Indeed  the  A 
phase  is  also  bounded  in  phase  diagrams  of  binary  mixtures  of  two  cyano  compounds,  if 
only  one  of  the  components  exhibits  the  N^  phase  (figure  6).  This  can  also  be  accounted 
for  by  using  the  Landau  model  if  g  is  taken  to  be  a  function  of  both  density  and 
composition  (Pershan  and  Prost  1979).  Thermodynamic  models  to  explain  such  non¬ 
linear  phase  diagrams  have  also  been  developed  (Clark  1979;  Klug  and  Whalley  1979; 
Heppke  and  Schneider  1979;  Billard  1979). 

Longa  and  de  Jeu  (1982)  have  given  a  detailed  molecular  theory  of  the  AN^ 
transition.  By  treating  the  system  as  a  mixture  of  monomers  and  dimers  interacting 
through  both  attractive  and  hard-core  repulsive  interactions,  they  essentially  work  out 
the  requirements  for  the  occurrence  of  the  optimum  density  that  was  referred  to  in  the 
previous  paragraph.  They  show,  using  the  framework  of  the  McMillan  model  of  the  A 
phase,  that  unfavourable  packing  of  the  dimers  in  planes  leads  to  N,^  phase  for  d/l 
~  1-3-1-4.  We  might  also  mention  here  a  paper  by  Hida  (1981)  who  has  developed  a 
lattice  theory  of  the  AN  ^  transition  by  assuming  that  dimer  pairs  occupy  an  fee  lattice. 
Allowing  for  vacancies  in  such  a  lattice,  he  shows  that  reentrance  is  possible  for  certain 
model  parameters. 

Another  model  for  the  occurrence  of  the  reentrant  phases  was  proposed  by  Berker 
and  Walker  (1981).  It  is  based  on  the  competition  between  short-range  positional  order 
in  the  layers  and  long-range  antiferroelectric  order  which  arises  due  to  interactions 
between  the  dipole  moments.  This  is  called  the  ‘frustrated  spin-gas’  model,  since,  as  we 
mentioned  earlier,  frustration  is  inherent  in  a  triangular  (or  equivalently  hexagonal) 
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Figure  6.  Phase  diagram  of  the  binary  mixtures  of  4-cyano-benzylidene-4'-alkoxyanilines. 
The  lines  are  calculated  on  the  basis  of  a  thermodynamic  theory  (Heppke  and  Schneider  1 979). 


a 


ooo 


Figure  7.  The  frustrated  spin  model  for  the  occurrence  of  Nj^  phase.  A  triangular  lattice  (b)  in 
which  two  bonds  are  stronger  than  the  third  can  form  a  network.  As  the  density  is  increased, 
the  molecules  would  like  to  be  in  a  close  packed  triangular  lattice  (a),  thus  bringing  in 
‘frustration’  and  hence  a  transition  to  the  phase. 


close  packing  of  the  molecules  in  the  layers:  if  two  molecules  have  already  formed  an 
antiparallel  pair,  a  third  molecule  which  would  try  to  sit  on  a  triangular  lattice  with 
respect  to  the  first  two  would  be  subjected  to  ‘frustration’  in  respect  of  its  orientation. 
The  authors  propose  that  this  can  be  relieved  by  packing  the  molecules  in  a  triangular 
lattice  such  that  there  are  two  strong  bonds  and  a  weak  bond,  thus  leading  to  the 
formation  of  a  network  (figure  7),  and  this  obviously  favours  the  formation  of  a 
(bilayer)  smectic  A  phase.  As  the  density  is  increased,  the  molecules  have  to  come  closer, 
and  pack  in  a  regular  triangular  lattice.  Frustration  effects  will  destroy  the  network  and 
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the  system  goes  over  to  the  re-entrant  nematic  phase.  By  assuming  attractive 
interactions  between  the  alkyl  chains,  a  reentrant  monolayer  smectic  A  phase  is 
predicted  to  occur  with  a  ferroelectric  order  in  the  layers.  However,  it  may  be  pointed 
out  that  according  to  this  model,  the  bilayer  would  have  a  long  range  antiferroelectric 
order  in  the  networks,  and  this  should  have  profoundly  influenced  both  the  static  and 
dynamic  parts  of  dielectric  anisotropy.  The  measurements  mentioned  earlier  do  not 
appear  to  support  this  model. 

Frost  (1980)  points  out  that  the  reentrant  nematic  phase  occurring  between  the 
bilayer  and  monolayer  A  phases  has  fluctuations  of  both  types;  and  it  is  thus  a  result  of 
the  mutual  exclusion  of  the  two  types  of  smectic  phases  which  are  competing  for 
condensation.  The  reentrant  phenomenon  in  this  model  comes  out  as  one  of  the 
possibilities  in  a  system  in  which  two  lengths  are  important,  the  other  possibilities  being 
different  modifications  of  the  smectic  A  phase  which  will  be  discussed  in  the  next 
section. 

On  the  experimental  side,  some  of  the  more  recent  findings  include  a  sequence 
NACNr  (Tinh  et  al  1982b;  The  Halle  group  has  found  many  mixtures  with  such  a 
sequence,  see  Pelzl  et  al  1981),  a  (tilted)  smectic  C  phase  appearing  between  the  A  and 
Nr  phases  (see  table  1)  and  a  more  exotic  triple  reentrant  behaviour  which  will  be 
referred  to  in  the  next  section. 


3.  Polymorphism  of  smectic  A 

X-ray  investigations  of  compounds  with  cyano  end  groups  have  shown  various  other 
remarkable  phenomena  in  recent  years.  For  example,  in  pentyl  cyanoterphenyl,  a 
compound  which  exhibits  only  a  nematic  phase,  short-range  correlations  correspond¬ 
ing  to  two  incommensurate  wavelengths  (at  ~  /and  ~  1-5  /)  have  been  found  (Brownsey 
and  Leadbetter  1980),  just  as  in  the  Nr  phase  separating  A  and  Ar  phases  discussed  in 
the  previous  section. 

4-n-Pentylphenyl-4'-(4"-cyanobenzoyloxy)benzoate  (hereafter  called  DBS,  see 
table  2)  has  two  ester  linkage  groups  in  the  aromatic  core  which  are  arranged  such  that 
their  dipole  moments  are  essentially  antiparallel  to  the  dipole  moment  of  the  cyano  end 
group.  This  compound  has  an  NA  transition  which  occurs  at  ~  110°  below  T^^,,  and 
according  to  the  McMillan  criterion  (McMillan  1971)  should  have  had  only  a  second 
order  NA  transition.  Actually,  however,  it  has  a  moderately  large  first  order  NA 
transition.  Meyer  and  Lubensky  (1976)  accounted  for  this  by  developing  a  mean  field 
theory  of  the  A  phase  with  a  couping  between  the  first  and  second  translational  order 
parameters.  Indeed  as  we  shall  see  presently,  DBS  and  similar  compounds  have  been 
found  to  possess  a  bilayer  spacing  which  is  close  to  twice  the  molecular  length,  which  in 
turn  give  rise  to  a  rich  variety  of  smectic  A  polymorphism. 

The  first  direct  evidence  for  the  coupling  between  the  first  and  second  fourier 
components  of  the  density  wave  was  obtained  in  a  mixture  of  DBS  with  a  weakly  polar 
compound,  viz  terephthal-6/s-butylaniline  (tbba)  which  exhibits  a  monolayer  A  phase 
(Sigaud  et  al  1979).  Calorimetric  and  x-ray  investigations  clearly  indicate  a  first  order 
transition  from  a  monolayer  phase  (AJ  to  a  bilayer  phase  whose  layer  spacing  is  ~  2  / 
(A2  phase).  Figure  8  shows  the  proposed  molecular  arrangement  in  the  two  phases. 
Pure  DB7,  a  higher  homologue  DBS  (see  table  2)  exhibits  at  higher  temperatures  of  its 
A  phase  a  layer  spacing  of  ~  1-6  /  with  a  coexisting  diffuse  spot  corresponding  to  ~  /. 
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Table  2. 


COMPOUND  or  MIXTURE 

SEQUENCE  OF  TRANSITIONS 

^ — O.OC  — — O.OC  — — CN 

4-n-  pentylphenyl— 4-'(  4  -cyanobenzoyloxy) benzoate  (DB5} 

-h 

C4H9  — — N=CH — CH=N — 

83.4  7.' 

,  Nt34Ai117A2 

terephthal  -  bis  (-  p  -butylonilme  )  {  TBBA) 

16.67. 

^ — O-OC — ^ — OOC— ^ — CN  1DB71 

I  233  N172Ad168A2 

nC5H„-(^>-CH=CH-<g>-0.0C  — 0-CN 

4-  ( 4-n-pentylstytyi )  phenyl  -  4'-  cyanobenzooteCS  stilbene) 

46  7. '' 

-f 

N  145  Ai  125  A  116  A2 

DBS 

54  7.  _ 

nC7Hi5  —0-O.OC  O.OC  — (g)-  ^^2 

4  —  n  -heptyt phenyl  —4 -  nitrobenzoyloxy  benzoate 

N95  A,  90A 

(  DB7N02) 

'^^10^^21 ^ ^ — ’^•0^  — (0^ — 

'CH3 

4  -n-decylphenyt  -3- methyl  -4  -  (4''-cyanoben2oybxy) 

N139.5  Ad  124.5  A2 

benzoate  (  10  PMCBB) 

— ^5) — O-OC — — O.OC — — NO2 

1224N195  Adl56NRl38Ad- 

(DB90N02  ) 

127Nr  124Ai  1185?  96C2 

illM 

V 1 1  ri 
11111 
I  111  1 
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Figure  8.  The  molecular  arrangement  in  (a)  the  Aj  phase,  and  (b)  the  A2  phase. 


This  phase  with  I  <  d  <21  is  called  the  Aj  phase.  (All  the  bilayer  A  phases  of 
compounds  exhibiting  reentrant  nematic  phases  belong  to  this  class.)  As  the 
temperature  is  lowered,  a  phase  transition  occurs  such  that  in  the  lower  temperature 
phase,  there  are  two  Bragg  spots  corresponding  to  d  ~  /  and  d  ~  21,  i.e.  there  is  a 
commensurate  lock-in  of  the  two  wavelengths  in  the  lower  temperature  A2  phase. 

Yet  another  type  of  smectic  A  (known  as  smectic  antiphase.  A)  is  found  in  some  pure 
compounds  and  mixtures.  For  example,  some  mixtures  of  DBS  and  C5  stilbene 
(table  2)  show  a  high  temperature  Aj  phase  (with  d  =  2n/q2  ^  0  with  a  diffuse 
scattering  at  wave  vector  which  is  split  such  that  qilqu  —  2-  On  cooling  this 
compound,  a  phase  transition  occurs  such  that  ^2  is  not  altered  but  diffuse  maxima 
corresponding  to  qi  get  condensed.  As  a  result,  a  superlattice  long  range  order  with 
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~  130  A  develops  in  the  lateral  direction  (figure  9).  Cooling  the  sample  further, 
yet  another  transition  takes  place  such  that  a  collinear  commensurate  locking  of  the 
two  wavelengths  occurs  along  the  z-direction.  In  other  words,  the  lateral  superlattice 
collapses  in  favour  of  an  Aj  phase.  Thus  the  sequence  of  transitions  on  cooling  this 
mixture  from  the  nematic  phase  is  N-Aj-A-Aj.  Some  pure  compounds  are  also  now 
known  to  exhibit  the  A  phase  (table  2). 

We  have  recently  studied  many  compounds  with  the  basic  chemical  structure 
similar  to  that  of  the  DBn  series,  except  for  the  presence  of  a  lateral  methyl  or 
methoxy  substitqent.  Indeed  the  resulting  4-cyanophenyl-3'-methyl-4'(4"-n-alkyl- 
benzoyloxy)benzoates  (called  mpmcbb)  are  obtained  by  interchanging  the  two  end 
groups  of  ncPMBB  compounds,  some  of  which  exhibit  the  phase  (tables  1  and  2).  10 
PMCBB  exhibits  a  large  increase  of  its  bilayer  spacing  as  the  temperature  is  lowered,  with 
a  jump  of  ~  0-4  A  at  —  T  ~  15°  (figure  10)  (Madhusudana  et  al  1982b).  The  second 
order  reflection  is  hardly  seen  in  the  high  temperature  phase  but  shows  up  strongly  in 
the  lower  temperature  A  phase  and  the  transition  corresponds  to  an  Aj-A2  transition. 
The  physical  interpretation  (Madhusudana  et  al  1982c)  of  this  phenomenon  can  be 
understood  from  figure  11.  In  12  cpmbb,  which  exhibits  the  N,^  phase,  the  two  ester 
group  dipole  moments  are  parallel  to  the  dipole  moment  of  the  cyano  group,  and 
neighbouring  molecules  are  arranged  as  in  figure  11a,  to  minimise  the  mutual 
interaction  energy  arising  from  dispersion  as  well  as  dipolar  forces.  On  the  other  hand 
in  12  PMCBB,  such  an  arrangement  (figure  1  lb)  is  not  likely  to  lead  to  a  minimum  in  the 
interaction  energy.  Though  mutual  dispersion  interactions  should  favour  this  configur¬ 
ation,  it  is  clear  that  the  dipolar  energy  between  the  cyano  group  and  one  of  the  ester 
groups  of  the  neighbouring  molecule  would  be  repulsive.  The  molecules  would  then 
prefer  the  arrangement  shown  in  figure  1  Ic  so  as  to  make  the  dipolar  interaction  energy 
highly  negative.  Indeed  this  energy  ( ~  is  a  few  times  the  thermal  energy  /c^T. 
However,  this  interaction  is  confined  to  one  end  of  the  molecule  and  the  structure  is 


Figure  9.  Schematic  diagram  of  the  arrangement  of  molecules  in  a  smectic  antiphase  (A). 
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Figure  10.  Temperature  variation  of  the  layer  spacing  of  10  pmcbb.  The  Aj-Aj  transition 
occurs  with  a  jump  in  the  layer  spacing  at  the  temperature  corresponding  to  the  dashed  line. 
The  region  around  this  transition  point  is  shown  on  a  magnified  scale  in  the  inset. 
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(O)  (b)  (C) 

Figure  11.  Schematic  diagrams  showing  the  disposition  of  various  dipolar  groups  of  a  pair 
of  (a)  «-cPMBB,  (b)  n-PMCBB  molecules  with  an  overlap  of  the  aromatic  cores  and  (c)  a  pair  of 
n-PMCBB  molecules  with  an  overlap  of  the  polar  end  groups. 
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hence  rather  fragile,  especially  since  the  molecules  have  a  relatively  long  alkyl  chain  and 
the  lateral  methyl  group.  As  the  temperature  is  raised,  the  structure  easily  breaks  up  and 
more  molecules  form  associations  as  in  figure  lib  which  should  still  have  a  negative 
interaction  energy.  As  a  consequence  the  layer  spacing  rapidly  decreases  with  increase 
of  temperature.  If  the  energy  difference  between  the  arrangements  of  figures  lib  and 
1  Ic  is  w,  the  number  of  molecules  having  configuration  c  is  oc  exp  (  —  w//cT]  and  one 
can  calculate  the  temperature  variation  of  the  layer  spacing  (Guillon  and  Skoulios 
1983). 

If  we  replace  the  cyano  end  group  of  10  pmcbb  with  a  nitro  group,  the  trend  of  the 
thermal  variation  of  the  layer  spacing  becomes  somewhat  different  (figure  12; 
Madhusudana  et  al  1982a).  The  sign  of  the  curvature  of  the  variation  is  opposite  to  that 
of  10  PMCBB  (figure  10)  and  further,  there  is  no  phase  transition  in  the  smectic  range. 
The  longitudinal  component  of  the  dipole  moment  of  the  NO2  group  is  practically  the 
same  as  that  of  CN  group.  However  the  lateral  components  of  NO  bonds  should  make 
the  mutual  interactions  in  configuration  c  of  figure  1 1  somewhat  repulsive  and  hence 
lead  to  the  difference  in  the  trend  of  layer  spacing  variation. 

The  dielectric  properties  of  such  compounds  also  are  in  conformity  with  the  model 
given  above.  For  example,  as  shown  in  figure  13,  the  dielectric  anisotropy  changes  sign, 
becoming  negative  at  lower  temperatures  of  the  A  phase  of  10  pmnbb,  as  more  and  more 
molecules  take  the  configuration  of  figure  1  Ic,  and  hence  the  contribution  of  the  strong 
end  group  dipole  moment  to  the  dielectric  constant  parallel  to  the  director  is  reduced. 

Figure  14  gives  a  schematic  diagram  of  the  expected  x-ray  diffraction  pattern  for 
different  modifications  of  the  A  phase.  A  comprehensive  theory  to  describe  the  various 
polymorphic  forms  of  the  A  phase  and  transitions  between  them  has  been  developed  by 
Frost  (1980, 1981).  He  has  written  down  a  general  Landau  free  energy  expression  with 
two  coupled  order  parameters  which  describe  the  tendency  of  the  molecules  to 
condense  at  two  (generally  incommensurate)  wavelengths.  The  variety  of  A  phases 
observed  is  a  consequence  of  the  competition  between  the  two  wavelengths.  According 
to  this  model,  each  of  these  A  phases  is  in  effect  a  ‘frustrated’  smectic,  in  which  the 
competition  between  the  two  wavelengths  is  sorted  out  in  a  specific  manner.  Thus,  for 
instance,  the  A  phase  (figure  9)  is  essentially  an  escape  from  incommensurability  when 


Figure  12.  Temperature  variation  of  the  layer  spacing  of  10  pmnbb. 
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Figure  13.  Temperature  variations  of  the  dielectric  constants  and  anisotropy  of  10  pmnbb. 
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Figure  14.  Schematic  diagram  of  the  expected  x-ray  diffraction  spots  from  different 
modifications  of  smectic  A  phase  (Levelut  et  al  1981). 


it  is  too  large,  i.e.  /'  is  much  different  from  2  /.  Then  the  dipolar  heads  switch  from  one 
layer  to  another  periodically,  thus  developing  a  transverse  two-dimensional  order.  As 
we  saw  at  the  end  of  the  previous  section.  Frost  has  also  described  the  reentrant 
phenomenon  (NA^NgAi)  in  the  same  framework,  but  including  the  effect  of 
fluctuations.  We  end  this  section  by  ppinting  out  a  remarkable  result  recently  found  by 
Tinh  et  al  (1982a)  in  a  compound  similar  to  the  DBn  series,  except  that  the  end  cyano 
group  is  replaced  by  a  nitro  group.  4-nonyloxyphenyl-4'-nitro-benzoyloxy  benzoate 
(table  2)  exhibits  the  following  sequence  on  cooling: 


INA,N«A,rN,A,S(?)C2. 
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Though  many  other  compounds  with  similar  structure  exhibit  only  A2  and  A  phases, 
this  particular  compound  with  a  nonyloxy  end  chain  rather  unexpectedly  shows  two 
reentrant  nematic  phases. 

4.  Induced  smectic  phases 

When  two  nematogenic  compounds  are  mixed,  one  would  expect  that  the  mixtures  also 
exhibit  only  the  nematic  mesophase.  However,  when  one  of  the  components  has  the 
strongly  polar  cyano  or  nitro  group  and  the  other  component  has  no  such  end  group,  in 
certain  composition  ranges  the  mixture  actually  exhibits  an  ‘induced’  smectic  phase 
which  is  usually  of  the  A  type  (Dave  et  al  1966;  Pelzl  et  al  1968;  Park  et  al  1975).  Such  a 
phase  diagram  is  displayed  in  figure  15  (Moodithaya  and  Madhusudana  1979).  Thus 
permanent  dipole-induced  dipole  interactions  appear  to  be  important  for  the 
occurrence  of  the  induced  phase.  Further,  there  is  experimental  evidence  that  a  charge 
transfer  complex  formation  takes  place  between  the  molecules  of  the  two  kinds 
(Sharma  et  al  1980),  the  highly  polar  component  acting  as  an  acceptor  while  the  other 
component  acts  like  a  donor.  Such  an  interaction  appears  to  lead  to  the  formation  of  a 
layered  arrangement  characteristic  of  smectic  phases.  (In  some  cases  even  the  smectic  E 
phase  which  has  orthorhombic  symmetry  is  induced,  see  for  example,  Schneider  and 
Sharma  1981). 

If  the  highly  polar  compound  itself  exhibits  a  bilayer  A^  phase,  then  the  mixtures  of 
such  a  compound  with  some  weakly  polar  compounds  exhibit  a  maximum  as  well  as  a 
minimum  in  the  A-N  transition  boundary  (Srikanta  and  Madhusudana  1982).  Figure 
16  illustrates  such  a  phase  diagram.  The  maximum  which  occurs  at  ~  50  mole  %  of  the 
two  components,  corresponds  to  the  induced  A  phase.  It  is  obvious  from  the  occurrence 


Figure  15.  Phase  diagram  of  mixtures  of  (2-hydroxy )-p-ethoxy-benzylidene-p'-butylaniline 
(ohebba)  with  7CB. 
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Figure  16.  Phase  diagram  of  mixtures  of  p'-nitrophenyl-p-n-octyloxybenzoate  (npoob)  and 
OH-EBBA. 


of  the  minimum  in  the  AN  transition  boundary  that  the  induced  A  phase  is  essentially 
incompatible  with  the  bilayer  A^  phase  of  the  highly  polar  component.  Indeed  x-ray 
studies  show  that  for  compositions  rich  in  the  weakly  polar  component,  the  layer 
spacing  corresponds  to  an  appropriate  average  molecular  length  of  the  two  com¬ 
ponents  (Engelen  et  al  1979;  Srikanta  and  Madhusudana  1982).  In  other  words,  it  is  an 
Ai  phase. 

The  layer  spacing  in  the  A^  phase  of  npoob  is  1-2  /.  For  compositions  between  the 
minimum  and  maximum,  the  medium  has  a  mixture  of  monolayer  and  bilayer  species. 
This  coexistence  appears  to  smear  out  any  possible  A^-Ai  transition  which  could  in 
principle  occur  near  the  minimum  of  the  AN  boundary.  However,  as  the  minimum  is 
approached  from  either  side,  the  thermal  expansion  coefficient  of  the  layer  spacing 
increases  significantly  (figure  17).  We  can  understand  this  by  assuming  that  the 
interaction  energy  between  the  strongly  polar  and  weakly  polar  components  (including 
contributions  from  both  dipole-induced  dipole  interactions  and  charge  transfer 
complex  formation)  is  only  slightly  higher  than  the  interaction  energy  which  gives  rise 
to  antiparallel  correlations  between  two  neighbouring  npoob  molecules.  Consequently, 
as  the  temperature  is  increased,  the  strongly  polar- weakly  polar  pairs  break  up  allowing 
NPOOB  molecules  to  form  more  antiparallel  pairs  (We  note  that  at  the  composition 
corresponding  to  the  minimum  in  the  AN  boundary,  there  are  3  npoob  molecules  for 
every  OH-EBBA  molecule).  This  naturally  leads  to  a  considerable  thermal  expansion  of 
the  layers.  The  dielectric  studies  on  the  same  system  confirm  this  mechanism  (Srikanta 
and  Madhusudana  1982).  Thus  this  system  also  manifests  the  effect  of  a  competition 
between  two  characteristic  lengths  at  the  molecular  level. 

In  conclusion,  strong  longitudinal  dipole  moments  in  rod-like  molecules  give  rise  to 
an  antiparallel  correlation  between  neighbouring  molecules.  Depending  on  the  detailed 
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Figure  17.  The  thermal  expansion  coefficients  of  the  layer  spacings  of  various  mixtures  of 
NPOOB  with  OH-EBBA  at  —  T  =;  2-5°. 

chemical  structure  of  the  molecules,  this  in  turn  gives  rise  to  a  ‘bilayer’  of  a  thickness 
which  lies  between  1  and  2  molecular  lengths.  The  coupling  between  this  length  and  the 
molecular  length  which  would  usually  characterize  a  density  wave  in  the  medium  gives 
rise  to  a  variety  of  interesting  phenomena  in  the  smectic  phases  formed  by  such  systems. 
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XANES  and  exafs  of  copper  compounds:  Studies  of  copper 
carboxylates  with  metal-metal  bonds  and  of  the  complex  formed 
by  Pseudomonas  aeruginosa^ 


P  R  SARODE,  G  SANKAR  and  C  N  R  RAO* 

Solid  State  and  Structural  Chemistry  Unit,  Indian  Institute  of  Science,  Bangalore  560012, 
India. 

Abstract.  X-ray  absorpion  near  edge  structure  (xanes)  of  copper  compounds  with  copper  in 
1^,  2^  and  3'*'  states  has  been  studied.  Extended  x-ray  absorption  fine  structure  (exafs)  has 
been  employed  to  determine  bond  distances  and  coordination  numbers  in  several  model 
copper  compounds.  Employing  both  xanes  and  exafs,  the  structure  of  the  copper  complex 
formed  by  the  micro-organism  Pseudomonas  aeruginosa  has  been  shown  to  be  square-planar 
with  the  Cu-O  distance  close  to  that  in  cupric  glucuronates  and  cupric  acetylacetonate.  exafs 
has  been  shown  to  be  useful  for  studying  metal-metal  bonds  in  copper  carboxylates. 

Keywords,  xanes;  exafs;  copper  compounds;  x-ray  spectra. 


1.  Introduction 

X-ray  absorption  near  edge  structure  (xanes)  and  extended  x-ray  absorption  fine 
structure  (exafs)  have  become  powerful  tools  for  studies  in  elucidating  the  structure  of 
various  compounds  (Belli  et  al  1980;  Parthasarathy  et  al  1982;  Sankar  et  al  1983).  xanes 
provides  information  about  the  nature  of  coordination  in  metal  compounds,  while 
exafs  gives  bond  distances  besides  the  coordination  numbers.  These  tools  are  especially 
useful  for  investigating  biological  compounds,  amorphous  materials  and  other  systems 
which  may  not  be  available  in  the  form  of  single  crystals  for  crystallographic  studies.  In 
this  paper  we  describe  the  results  of  our  investigation  of  copper  compounds  employing 
both  xanes  and  exafs.  After  having  verified  the  results  from  these  techniques  with 
respect  to  model  compounds,  we  have  used  the  techniques  to  elucidate  the  structure  of 
copper  complex  formed  from  the  slime  of  the  Pseudomonas  aeruginosa.  This  micro¬ 
organism  accumulates  copper  to  a  considerable  extent  and  little  is  known  about  the 
actual  structure  of  the  copper  complex  so  formed  (Payne  et  al  1981).  We  have  also 
examined  the  use  of  exafs  to  study  copper  carboxylates  containing  metal-metal  bonds. 
These  studies  have  been  able  to  establish  square-planar  coordination  in  the  copper 
complex  formed  from  Pseudomonas  aeruginosa  and  also  provides  the  metal-oxygen 
bond  distances  in  the  complex  as  well  as  in  model  copper  glucuronates. 

2.  Experimental 

Analar  samples  of  CUSO4  ■  5H2O  and  CUCI2  •  2H2O  were  obtained  commercially.  All 
other  copper  (I)  and  (II)  compounds  were  prepared  by  standard  methods  (Brauer  1963). 


t  Communication  No.  228  from  the  Solid  State  and  Structural  Chemistry  Unit 
•To  whom  all  correspondence  should  be  addressed. 
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Ba2Cu205  was  prepared  by  the  method  of  Arjomand  and  Machin  (1975).  Cupric 
glucuronates  and  copper  complex  formed  from  Pseudomonas  aeruginosa  were 
prepared  by  the  methods  already  reported  (Payne  et  al  1981). 

X-ray  absorption  spectra  were  recorded  using  a  bent-crystal  spectrograph.  The 
absorbers  of  the  compounds  were  prepared  by  spreading  the  finely-powdered  specimen 
on  cellophane  adhesive  tape.  The  energy  scale  was  fixed  by  using  the  emission  lines  of 
tungsten  which  appeared  due  to  the  impurity  in  the  target.  A  Carl-Zeiss  MD  100 
densitometer  was  used  to  obtain  the  traces  of  these  photographically  recorded  spectra. 
In  order  to  obtain  finer  details  of  the  spectra,  200  x  magnification  scans  were  taken  for 
each  compound.  The  linear  reverse  dispersion  on  the  traces  in  each  case  was 
0-35  eV/mm.  The  uncertainty  in  the  edge  and  its  near  structure  at  about  50  eV  was 
±  0-5  eV  and  beyond  50  eV  it  was  about  1  eV.  We  have  been  able  to  observe  the  fine 
structure  beyond  the  edge  upto  about  ~  550  eV. 

2.1  Data  reduction  and  analysis 

To  extract  the  extended  x-ray  absorption  fine  structure  modulation,  y(E)  {=  (jx  —  go)! go, 
where  g  is  the  observed  absorption  coefficient  and  go  is  the  absorption  coefficient 
modulation  in  the  absence  of  neighbouring  atoms)  from  the  raw  absorption  spectra, 
background  subtraction  was  done  using  a  method  due  to  Lytle  et  al  (1975).  This 
involves  the  fitting  of  a  polynomial  (Victoreen  function)  to  the  pre-edge  portion  of  the 
raw  data  and  extrapolating  it  over  the  whole  range  of  the  absorption  data  and 
subtracting  it.  After  this  pre-edge  subtraction,  a  second  degree  polynomial  is  fitted 
through  the  centre  of  exafs  oscillations.  This  curve  is  assumed  to  be  the  go  vs  energy 
curve  and  subtracted  from  the  data  and  the  difference  is  then  divided  by  it  in  order  to  get 
{g  —  go)lgo-  This  procedure  gives  the  normalized  absorption  coefficient  as  a  function  of 
photon  energy.  For  structural  analysis,  the  normalized  fine  structure  was  obtained  as  a 
function  of  photoelectron  wavevector,  k,  (in  A“^),  defined  by  the  relation  k 
=  [0-262467  {E  —  £o)]  ^  eV,  where  E  is  the  photon  energy  and  Eo  the  threshold  energy 
at  the  absorption  edge.  The  data  thus  obtained  was  multiplied  by  the  Hanning  window 
function  (Via  et  al  1979)  in  order  to  minimize  the  termination  ripples.  Subsequently,  it 
was  multiplied  by  a  weighting  function,  k^,  to  compensate  for  amplitude  reduction  as  a 
function  of  k.  The  integrations  (integration  limits  /c  =  3-5  to  k  =  11-6A~^)  for  the 
Fourier  transforms,  both  for  the  real  and  imaginary  components,  were  done 
numerically  using  Simpson’s  rule  (Cramer  et  al  1978).  The  modulus  of  the  Fourier 
transform  was  obtained  as  a  square  root  of  the  sum  of  the  real  and  imaginary  parts. 

The  extended  x-ray  absorption  fine  structure  associated  with  the  X-absorption  edge 
was  shown  to  have  (in  terms  of  structural  parameters)  the  following  general  form  (Stern 
1974;  Ashley  and  Doniach  1975): 

Xik)  =  =  r  Z  ^  bxp  ( -  (Tjk^)  exp  ( -  2Rj/X) 

go  K-  j 

sm[2kRj  +  (pj{k)].  (1) 

In  this  expression,  Nj  is  the  number  of  scatterers  (/)  at  distance  Rj  from  the  absorber; 
fj{n,  k)  is  the  scatterer’s  electron  backscattering  amplitude;  is  the  total  phase  shift;  oj 
is  the  mean  square  deviation  of  Rj,  X  is  the  mean  free  path  and  k  is  the  photoelectron 
wave  vector.  A  separate  modulation  or  wave  in  the  absorption  coefficient  is  produced 
by  each  coordination  shell  of  atoms  of  approximately  the  same  distance  R^  from  the 
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absorbing  atom.  The  observed  fine  structure  results  from  the  sum  of  waves  from  the 
various  coordination  shells.  The  object  of  the  data  analysis  procedure  is  to  decompose 
the  observed  spectrum  into  its  component  waves  and  determine  the  phase  and 
amplitude  of  each  component.  This  in  turn  can  yield  information  about  the  distance  in 
the  coordination  shells  and  the  number  of  atoms  in  them. 

A  least-squares  curve  fitting  analysis  was  employed  to  elucidate  finer  details  of  the 
structure  of  compounds.  The  background  subtracted  absorption  data  were  first 
Fourier-transformed  and  then  the  region  of  interest  in  FT  plot  was  Fourier-filtered  to 
isolate  the  first  and  second  coordination  shells.  Six  parameter  fits  were  performed  on 
the  data  using  a  parametrized  form  of  equation  (1) 


Zparam  =  Q  ’  exp  (C, )  sin  (flo  +  ^1^  +  ^2  A)-  (2) 

Fits  were  done  by  least-squares  adjustment  of  parameters  Co,  C,,  C2,  Oq,  and  02 
using  a  general  optimization  technique  (Himmelblau  1972).  The  function  minimized  in 
the  fitting  procedure  is  given  by 


F  = 


expt 


^param  ^ 


N 


1/2 


(3) 


where  Xparam  is  Zhe  parametrized  exafs  function,  Xexpt  is  Fourier  filtered  exafs  function, 
N  is  the  number  of  data  points  in  the  fit  and  the  sum  is  over  the  number  of  data  points.  F 
is  a  measure  of  the  goodness  of  the  fit  and  is  lower  for  a  better  fit. 

The  curve  fitting  procedure  for  determining  the  distance  JR  to  a  shell  of  atoms  in  an 
unknown  compound  is  as  follows,  exafs  data  from  a  model  compound  whose  crystal 
structure  is  known  is  fitted  with  equation  (2)  by  adjusting  all  the  six  parameters.  The 
standard  parameters  Ci ,  C2,  Uq  and  02  are  then  used  as  fixed  constants  in  fitting  (2)  to 
data  from  the  unknown  compound  by  adjusting  only  Cq  and  .  If  the  phase  shift  (f){k) 
is  assumed  to  have  the  form  </)(Zc)  —  ao  +  Oik  +  a2/k,  then  comparison  of  the  theoretical 
equation  (1)  with  the  parametrized  equation  (2)  shows  that  =  2R  +  0.  The  linear 
phase  parameter  (f)  is  determined  from  the  standard  fit  since  R  is  known,  and  then  </>  is 
used  to  calculate  R  from  the  fit  of  the  unknown.  After  determining  R2  for  the  unknown, 
the  number  of  atoms  N  can  be  estimated  by  iV  =  (Co/Cos)(i?s//?)  where  N,  R  and  Cq 
refer  to  the  unknown  and  N^,  R^  and  refer  to  the  model  compound. 


3.  Results  and  discussion 

3.1  XANES  of  typical  copper  compounds 

Figures  1  and  2  show  plots  of  K-absorption  edges  of  representative  of  copper 
compounds  with  copper  in  1^,  2^^  and  3"^  states;  the  figure  also  includes  absorption 
edges  of  a  few  copper  complexes.  In  the  case  of  Cu^^  compounds  (figure  1)  we  are  able 
to  make  spectral  assignments  on  the  basis  of  the  assignments  available  for  other 
transition  metal  compounds  (Sankar  et  al  1983;  Shulman  et  al  1976).  The  small  pre-edge 
peak  at  about  8977-5  eV  is  due  to  the  vibronically  activated  Is  3^^  transition.  The  next 
feature  in  the  spectra  appearing  as  a  shoulder  is  attributed  to  the  forbidden  Is  4s 
transition,  which  appears  to  become  allowed  due  to  the  mixing  of  the  4p  and  4s  orbitals. 
The  highest  energy  peaks  in  the  spectra  arise  from  the  allowed  Is  ->  4p  transition  which 
ultimately  merges  into  the  continuum.  The  consistency  in  the  position  of  the  peaks  due 
to  the  Is  -+  3d  transition  is  noteworthy;  the  energies  of  Is  -+  4s  and  Is  4p  transitions 
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Energy(eV) 

Figure  1.  Copper  K-edge  absorption 
spectra  of  (a)  cupric  oxide;  (b)  copper 
propionate;  (c)  copper  citrate;  (d) 
sodium  copper  glucuronate;  (e)  barium 
copper  glucuronate;  (f)  magnesium 
copper  glucuronate;  (g)  copper  complex 
from  Pseudomonas  aeruginosa;  (h) 
cesium  copper  trichloride;  (i)  cupric 
chloride  dihydrate;  (j)  Ba2Cu205. 


Energy(eV) 

Figure  2.  Copper  K-edge  absorption 
spectra  of  CuFeS2;  CuCl,  CuBr  and  Cul. 


show  a  similar  behaviour.  Energies  of  these  three  electronic  transitions  for  several  Cu^ 
compounds  are  listed  in  table  1.  The  energy  range  for  the  ls^3d  transition  is 
8977-3-8978-3  eV  with  an  average  of  8977-8  eV.  The  standard  deviation  of  this  average 
for  the  Cu^"^  compounds  studied  is  only  0-6  eV.  The  energy  range  for  the  ls->4s 
transition  is  8985-8987  eV  with  an  average  8986  eV  and  a  standard  deviation  of  0-8  eV. 
The  average  energy  for  Is  4p  is  8996-5  eV. 

Absorption  edges  of  a  few  Cu^"^  compounds  are  shown  in  figure  2.  The  first  peak 
which  appears  as  a  shoulder,  in  the  spectrum  of  the  Cu^  ^  compounds  is  due  to  the  Is 
-♦4s  transition,  the  ls->-3d  transition  being  absent  in  this  system.  The 
ls-»-4s  transition  of  Cu^"^  comes  at  a  lower  energy  than  Cu^^.  The  energy  of  the 
Is -♦4s  transition  of  Cu^"^  is  8982-6  eV  and  this  does  not  overlap  with  either  the 
Is  -♦  3d  or  Is  4s  range  found  in  Cu^^  compounds,  thus  providing  a  clear  distinction 
between  the  edge  spectra  of  the  compounds  with  copper  in  the  two  different  oxidation 
states.  As  mentioned  earlier,  it  is  noteworthy  that  there  is  no  Is  -♦  3d  transition  in  Cu^ 
compounds  and  this  again  enables  the  distinction  between  Cu^"^  and  Cu^"^  spectra. 
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Table  1.  X-ray  K-absorption  edge  data  on  copper  compounds . 


Transition  energies  (eV) 


Compound 

Is -34 

Is  4s 

Is  —  4p 

Copper  ammonium  sulphate  monohydrate 

8978-0 

8984-3 

8994-0 

Cupric  oxide 

8978-2 

8985-0 

8995-6 

Anhydrous  copper  propionate 

8977-8 

8985-7 

8995-9 

Copper  sulphate  pentahydrate 

8977-4 

8986-5 

8996-6 

Copper  citrate  dihydrate 

8977-9 

8985-8 

8996-0 

Cesium  copper  trichloride 

8977-3 

8985-9 

8997-5 

Cupric  chloride  dihydrate 

8977-6 

8987-0 

8996-0 

Sodium  copper  glucuronate 

8977-5 

8985-1 

8995-3 

Barium  copper  glucuronate 

8977-4 

8985-0 

8995-4 

Magnesium  copper  glucuronate 

8977-4 

8985-1 

8995-1 

Copper  complex  formed  from  Pseudomonas  aeruginosa 

8978-3 

8985-3 

8995-5 

Differences  in  the  spectra  of  compounds  with  copper  in  the  2^  and  3"^  states  are 
small  compared  to  the  differences  between  the  spectra  of  Cu^  and  compounds. 
Ba2Cu205  containing  copper  in  the  Cu^"^  state  shows  a  pre-edge  transition  (figure  1) 
with  a  peak  at  8985  eV,  about  8  eV  above  the  highest  energy  found  in  Cu^"*^ 
compounds.  The  peak  position  of  the  Is  ->  4s  transition  is  found  at  8995  eV,  10  eV 
higher  than  that  of  Cu^  ^  compounds. 

3.2  EXAFS  of  typical  copper  compounds 

In  figure  3  we  show  background  subtracted  spectra  of  cupric  oxide,  anhydrous  copper 
propionate,  copper  ammonium  sulphate  monohydrate  and  copper  thiosalicylate.  In 
figure  4  we  have  shown  the  Fourier-transformed  spectra  in  i?-space  of  these 
compounds.  These  are  the  four-model  compounds  used  to  identify  near  neighbours  by 
determining  the  phase  shift  for  Cu-O,  Cu-Cu,  Cu-N  and  Cu-S  atom  pairs.  Except  for 
the  copper  propionate,  each  of  these  has  a  major  peak  in  the  ft  plot  (single-shell  model 
compounds)*.  This  intense  peak  is  isolated  using  a  suitable  window  function  (Teo  1981) 
by  retransforming  it  to  /c-space  and  then  fitted  with  (2)  by  adjusting  all  six  parameters 
Co,Ci,C2,ao,ai  and  <22 •  The  first  two  peaks  in  the  copper  propionate  transform  were 
retransformed  to  k-space  and  fitted  with  the  sum  of  two  waves  in  two-shell  fit.  The  first 
shell  contains  the  amplitude  and  phase  parameters  from  the  cupric  oxide  fit,  adjusting 
only  Co  and  ai ,  while  all  the  six  parameters  of  the  second  shell  corresponding  to  Cu-Cu 
atom  pair  were  adjusted.  Table  2  lists  the  results  of  the  fits  of  these  model  compounds 
including  the  fitting  function  value  from  (3),  and  the  parameters  Cq,  Cj,  C2,  a^,  a^,  and  02 
from  (2).  In  figure  5,  we  have  shown  the  fitted  and  experimentally  observed  (Fourier- 
filtered)  EXAFS  spectra  of  these  model  compounds.  The  curve-fitted  spectra  obtained 
from  the  parameterized  form  of  exafs  equation  matches  very  well  with  the  observed 
Fourier-filtered  spectra.  According  to  Citrin  et  al  (1976)  and  Cramer  et  al  (1978)  all 
these  phase  (oq,  a^,  02)  and  amplitude  (Co,  Cy,  C2)  parameters  can  be  transferred  to  the 


*  The  term  single  shell  refers  to  structures  where  the  absorbing  atom  is  surrounded  by  a  set  of  identical 
scatterers  at  nearly  equal  distances. 
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Figure  3.  Background-subtracted  exafs  spectra  of  four  model  compounds. 


system  in  which  the  absorbing  atoms  have  the  same  environments  to  predict  the 
structural  parameters.  As  a  first  test  of  transferability  of  these  Cu-X  parameters,  these 
phase  shifts  and  amplitudes  were  used  to  determine  other  crystallographically  known 
compounds.  The  compounds  chosen  for  this  test  were  copper  sulphate  pentahydrate, 
copper  acetylacetonate  and  copper  phthalocyanine  all  of  which  possess  square-planar 
co-ordination  (Bacon  and  Curry  1962;  Basu  et  al  1962;  Harrison  and  Assour  1964)  and 
bond  distances  comparable  to  those  in  CuO.  The  Fourier-transformed  spectra  for  these 
compounds  are  shown  in  figure  6.  To  obtain  structural  information  from  the  FT 
spectra  of  these  compounds,  the  observed  peak  position  is  simply  modified  by 
previously  determined  phase  shift  to  obtain  a  predicted  distance.  The  co-ordination 
number  is  then  obtained  from  the  observed  peak  height.  The  predicted  and 
crystallographically  observed  structural  results  are  summarized  in  table  3. 

For  curve-fitting  analysis  of  the  spectra,  Cq,  and  Ui  were  floated  while  Cj,  C2,  a^  and 
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Figure  4.  Fourier  transforms  of  the  four  model  copper  compounds.  Curves  shows  the 
magnitude  only.  Transform  range:  3-5-1 1-6  A” \  scaling. 


02  were  fixed.  The  two  parameters  floated  gave  bond  lengths  and  co-ordination 
numbers.  The  numerical  results  of  these  fits  are  presented  in  table  3.  We  see  that  the 
predicted  bond  distances  and  co-ordination  numbers  from  exafs  agree  with  those 
obtained  from  x-ray  diffraction  data.  The  close  correspondence  between  the  distances 
and  the  co-ordination  numbers  given  by  crystallography  and  exafs  of  the  same 
compounds  lends  confidence  to  our  ability  to  predict  co-ordinating  atoms  and 
distances  in  copper  compounds  of  unknown  structure. 

3.3  XANES  and  exafs  studies  of  copper  glucuronates  and  the  copper  complex  formed  by 
the  slime  of  Pseudomonas  aeruginosa 

Edge  spectra  for  sodium-copper  glucuronate,  barium  copper  glucuronate,  magnesium 
copper  glucuronate  and  a  copper  complex  isolated  from  the  slime  of  the  Pseudomonas 
aeruginosa  bacteria  are  shown  in  figure  1.  These  spectra  show  a  weak  peak  in  the  Cu^ 
Is 3d  region.  The  average  energy  of  the  Is  3d  peak  in  these  complexes  in  8977-6  eV. 

The  next  feature  due  to  Is  ->  4s  transition  appearing  as  a  shoulder  is  at  8985- 1  eV.  Both 
these  transition  energies  are  well  within  the  range  of  Cu^  ^  compounds  (see  table  1).  We 
can  therefore  conclude  that  copper  in  all  these  four  copper  complexes  are  in  Cu^  state. 
Accordingly  these  complexes  show  characteristic  esr  signals  due  to  Cu^  (Payne  et  al 
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Figure  5.  Fits  of  four  model  compounds.  The  solid  lines  are  least-squares  fits  and  the  dotted 
lines  are  the  Fourier  filtered  exafs  data. 


1981).  Furthermore,  the  Cu(2p3/2)  peaks  in  x-ray  photoelectron  spectra  of  these 
complexes  exhibit  characteristic  satellites  just  as  in  CuO  indicating  the  presence 
of  Cu^^  ions  (Payne  et  al  1981).  It  is  well-known  that  near-edge  structure  gives 
information  on  the  site  symmetry  of  absorbing  atoms  in  materials  (Belli  et  al  1980; 
Grunes  1983).  Since  all  the  spectral  features  in  the  above  complexes  are  essentially 
similar  to  those  of  Cu^  model  compounds  where  the  coordination  geometry  of  copper 
is  square-planar,  we  may  conclude  that  these  cupric  complexes  are  likely  to  have  square- 
planar  coordination.  Thus,  the  esr  spectra  of  these  copper  complexes  show  and  gj^ 
values  similar  to  cupric  acetylacetonate  (Payne  et  al  1981)  confirming  that  the 
complexes  are  square-planar. 
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Radial  dist-ancelA) 


Figure  6.  Fourier  transforms  of  copper  sulphate  pentahydrate,  copper  acetylacetonate  and 
copper  phthalocyanine. 


We  shall  now  discuss  the  results  of  exafs  analysis  of  these  complexes.  In  (1),  the  total 
phase  shift,  (f),  can  be  treated  (Cramer  and  Hodgson  1979)  as  the  sum  of  absorber,  (f>a, 
and  scatterer  phase,  shifts.  0(/c)  =  </><,(/()  +  0s(/c).  The  absorber  phase  shifts  (/»a(/c)  is  the 
result  of  the  photoelectron  propagating  out  of  and  back  through  the  central  atom 
potential.  Reflection  of  the  photoelectron  by  the  scattering  atom  potential  causes  the 
scatterer  phase  shift,  cj)^.  For  a  series  of  compounds  with  the  same  absorber,  differences 
in  total  phase  shift  are  caused  by  the  different  (f)^  of  the  scattering  atoms  involved.  The 
total  phase  shifts  obtained  from  the  experimentally  measured  exafs  data  as  a  function 
of  wavevector  are  used  to  predict  the  type  of  atoms  surrounding  the  absorbing  atom. 

In  the  present  investigation,  the  total  phase  shifts  were  obtained  by  subtracting  2Rk 
from  aQ  +  aik  +  a2/k  (for  copper  oxide,  copper  propionate,  copper  ammonium 
sulphate  and  copper  thiosalicylate),  k,  ranging  from  3-5  and  1 1-6  (A  “  ^).  In  figure  7  we 
have  plotted  these  empirical,  pair-wise  phase  shifts  for  Cu-O,  Cu-S,  Cu-N  and  Cu-Cu; 
the  curve  for  Cu-C  pair  could  not  be  given  since  we  do  not  have  a  model  compound 
with  carbon  neighbour  within  3A  range.  In  figure  8,  we  have  plotted  the  phase  shifts  for 
the  pairs  taken  from  theoretically  calculated  and  tabulated  data  of  Teo  and  Lee  (1979) 
and  Lee  et  al  (1977).  Back  scattering  phase  shifts  for  nitrogen  are  not  given  in  the  tables 
of  these  authors;  the  phase  shifts  for  the  Cu-N  atom  pair  (figure  8)  was  obtained  by 
interpolation  as  suggested  by  Teo  and  Lee  (1979).  We  see  from  figures  7  and  8  that 
Cu~0,  Cu-N,  Cu-Cu  and  Cu-S  phase  shifts  are  all  similar.  The  differences  in  phase 
shifts  for  O,  Cu,  N,  S,  permit  straightforward  differentiation  between  these  back- 
scattering  atoms.  In  figure  7,  we  have  shown  the  dotted  curve  for  the  copper  complex 


Table  3.  Calculation  of  single-shell  distances  and  coordination  numbers.  Model  compound;  CuO. 
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Figure  7.  Experimentally  determined  copper-backscatterer  phase  shifts  vs.  photoelectron 
wave  vector,  k.  In  has  been  subtracted  from  the  Cu-Cu  phase  shifts. 


Figure  8.  Theoretically  calculated  copper-backscatterer  phase  shifts  vs.  photoelectron  wave 
vector,  k.  2n  has  been  subtracted  from  the  Cu-Cu  phase  shifts. 
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formed  from  Pseudomonas  aeruginosa.  We  see  that  the  curve  overlaps  with  that  for  the 
Cu-O  atom  pair  once  again  establishing  that  copper  in  this  complex  has  near 
neighbour  oxygen  atoms  (in  the  first  co-ordination  shell);  synthetic  copper  glucuro- 
nates  also  exhibit  similar  curves.  After  identifying  the  type  of  surrounding  of  copper  in 
these  complexes,  we  will  now  determine  the  bond  distances  and  co-ordination  numbers 
from  EXAFS  data. 

In  figure  9  we  have  shown  the  Fourier  transform  spectra  of  sodium  copper 
glucuronate,  barium  copper  glucuronate,  magnesium  copper  glucuronate  and  complex 
obtained  from  Pseudomonas  aeruginosa.  The  ft  spectra  retransformed  to  /c-space  and 
fitted  to  the  parametrized  equation  (2)  varying  the  amplitude  parameter,  Cq  and  phase 
shift  parameter  Uj ,  while  the  other  four  parameters  were  fixed  constant.  The  results  of 
these  analysis  are  summarized  in  table  3.  As  expected  the  co-ordination  number  in  all 
these  complexes  is  four  establishing  thereby  that  they  are  all  square-planar.  We  readily 
see  that  the  distances  in  the  copper  glucuronates  and  in  the  complex  from  Pseudomonas 
aeruginosa  are  quite  similar  to  those  of  copper  acetylacetonate. 

Since  the  analysis  of  the  slime  produced  by  the  bacteria  Pseudomonas  aeruginosa 
without  copper  present,  using  hplc  (Magee  1981)  clearly  shows  that  glucuronic  acid  is  a 
major  constituent,  we  suggest  that  the  nature  of  the  complex  is  itself  copper 
glucuronate.  The  possible  structures  of  the  copper  complex  using  the  Haworth 
structure  for  the  uronic  acid  is  shown  in  chart  I.  In  structure  I,  copper  is  co-ordinated 


Figure  9.  Fourier  transforms  of  sodium  copper  glucuronate,  barium  copper  glucuronate, 
magnesium  copper  glucuronate  and  copper  complex  formed  from  Pseudomonas  aeruginosa. 


250 


P  R  Sarode,  G  Sankar  and  CNR  Rao 


H 


Chart  1. 


II 


through  the  ether  oxygen  of  carbon  1  and  the  aldehydic  oxygen.  In  structure  II,  copper 
is  coordinated  through  the  hydroxyl  oxygens  of  carbon  3  and  4. 

3.4  Carboxylates  with  Cu-Cu  bonding 

In  order  to  determine  whether  a  peak  in  the  Fourier-transformed  spectra  correspond¬ 
ing  to  backscattering  from  a  metal  is  sufficiently  strong  compared  to  those  due  to 
oxygen  neighbours  to  be  easily  distinguished  from  them,  exafs  data  were  collected  on 
few  compounds  with  various  metal-metal  distances.  Anhydrous  copper  propionate, 
copper  acetate  monohydrate,  copper  benzoate  trihydrate,  copper  citrate  dihydrate  and 
copper  salicylate  tetrahydrate  have  Cu-Cu  distances  of  2-578  (Simonov  and 
Malinovskii  1970),  2-614  (Brown  and  Chidambaram  1973),  3-15  (Inoue  eta/ 1965),  3-242 
(Mastropaolo  et  al  1976)  and  3-728  A  (Inoue  et  al  1964)  respectively.  The  Fourier- 
transformed  spectra  of  these  are  shown  in  figure  10.  In  this  figure  the  arrows  show 
peaks  corresponding  to  the  Cu-Cu  atom  pairs.  The  propionate  transform  shows  the 
Cu-Cu  peak  at  2-151  A  which  means  that  the  linear  phase  shift  in  R  space  for  Cu-Cu 
pair  is  0-427  A.  Applying  this  phase  shift  to  the  other  transforms  of  the  remaining  three 
compounds,  we  have  determined  the  Cu-Cu  distances  in  these  compounds.  These 
distances  are  given  in  table  4.  In  this  table  we  have  also  given  the  distances  and 
coordination  numbers  obtained  by  curve  fitting  the  data  to  equation  (2).  There  is  a  good 
agreement  between  the  distances  and  coordination  numbers  determined  from  Fourier 
transform  and  those  obtained  from  curve-fitting.  There  is  also  a  fair  agreement  between 


XANES  and  exafs  of  copper  compounds 


251 


Figure  10.  Fourier  transforms  of  anhydrous  copper  propionate,  copper  acetate  monohy¬ 
drate,  copper  benzoate  trihydrate,  copper  citrate  dihydrate  and  copper  salicylate  tetrahydrate. 


Table  4.  Calculation  of  two-shell  scatterer  distances  and  numbers  in  copper  carboxylates. 
Model  compound:  Cu(CH3CH2COO)2. 


Radial  distance  Coordination  number 

Value  of - — 

the  fitting  FT  dis-  Curve-fit  FT  Curve-fit 

Compound  function  Cq  Ui  tance  (A)  distance  (A)  number  number 


Copper  acetate  monohydrate 


(i)  4Cu-Oat  1-969  A 

0-75 

0-446 

3-525 

1-984 

1-980 

4-5 

4-2 

(ii)  ICu-Cu  at  2-614  A 

2-532 

5-070 

2-591 

2-595 

1-3 

1-2 

Copper  benzoate  trihydrate 

(i)  4Cu-Oat  1-94  A 

0-98 

0-418 

3-447 

1-950 

1-934 

4-3 

4-1 

(ii)  ICu-Cu  at  3-15  A 

0-980 

6-198 

3-180 

3-160 

08 

0-9 

Copper  citrate  dihydrate 

(i)  4Cu-0  at  1-946  A 

0-81 

0-403 

3-456 

1951 

1-947 

3-9 

3-9 

(ii)  ICu^u  at  3-242  A 

0-967 

6-390 

3-250 

3-240 

0-7 

08 

Copper  salicylate  tetrahydrate 

(i)  4Cu-0  at  1-92  A 

0-99 

0-402 

3-426 

1-94 

1-93 

3-9 

4-1 

(ii)  ICu-Cuat  3-728  A 

0-971 

7-56 

3-76 

3-74 

08 

09 
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these  data  and  the  crystallographic  data  reported  in  literature. 

Except  for  the  acetate  and  the  propionate  where  the  Cu-Cu  distance  is  less  than 
2-80  A,  there  is  no  strong  metal-metal  peak  in  the  Fourier  transform  of  the  benzoate  or 
the  citrate  or  the  salicylate.  Since  the  exafs  amplitude  is  strongly  dependent  on  the 
thermal  motion  of  the  backscatterers  with  respect  to  the  absorbing  atom,  it  is  possible 
that  at  lower  temperature  metal-metal  components  can  be  more  clearly  distinguished  in 
the  FT  spectra. 

Acknowledgement 

The  authors  thank  the  Department  of  Science  and  Technology,  Government  of  India 
for  support  of  this  research  and  Prof.  R  J  Magee  for  interesting  them  in  the  Cu  complex 
formed  by  Pseudomonas  aeruginosa. 


References 

Arjomand  M  and  Machin  D  J  1975  J.  Chem.  Soc.  Dalton  Trans.  1061 
Ashley  C  A  and  Doniach  S  1975  Phys.  Rev.  Bll  1279 
Asbrink  S  and  Norrby  L  J  1970  Acta  Crystallogr.  B26  8 
Bacon  G  E  and  Curry  N  A  1962  Proc.  R.  Soc.  (London)  A266  95 
Basu  G,  Belford  R  L  and  Dickerson  R  E  1962  Inorg.  Chem.  1  438 

Belli  M,  Scafati  A,  Bianconi  A,  Mobilio  S,  Palladino  L,  Reale  A  and  Burattini  E  1980  Solid  State  Commun.  35 
355 

Brauer  G  1963  Handbook  of  preparative  inorganic  chemistry  (New  York;  Academic  Press) 

Brown  G  M  and  Chidambaram  R  1973  Acta  Crystallogr.  B29  2393 
Citrin  P  H,  Kincaid  B  M  and  Eisenberger  P  1976  Phys.  Rev.  Lett.  36  1346 
Cramer  S  P  and  Hodgson  K  O  1979  Prog.  Inorg.  Chem.  25  1 

Cramer  S  P,  Eccles  T  K,  Kutzler  F,  Hodgson  K  O  and  Doniach  S  1976  J.  Am.  Chem.  Soc.  98  8059 

Cramer  S  P,  Hodgson  K  O,  Stiefel  E  I  and  Newton  W  E  1978  J.  Am.  Chem.  Soc.  100  2748 

Grunes  L  A  1983  Phys.  Rev.  B27  2111 
Harrison  S  E  and  Assour  J  M  1964  J.  Chem.  Phys.  40  365 

Himmelblau  D  M  1972  Applied  non-linear  programming  (New  York:  McGraw-Hill) 

Inoue  M,  Kishita  M  and  Kubo  M  1964  Inorg.  Chem.  3  239 

Inoue  M,  Kishita  M  and  Kubo  M  1965  Inorg.  Chem  4  626 

Lee  P  A,  Teo  B  K  and  Simons  A  L  1977  J.  Am.  Chem.  Soc.  99  3856 
Lytle  F  W,  Sayers  D  E  and  Stern  E  A  1975  Phys.  Rev.  Bll  4825 
Magee  R  J  1981  Personal  communication 

Mastropaolo  D,  Powers  D  A,  Potenza  J  A  and  Schugar  H  J  1976  Inorg.  Chem.  15  1444 
Mazzi  F  1955  Acta  Crystallogr.  8  137 

Parthasarathy  R.  Fqrode  P  R,  Rao  K  J  and  Rao  CNR  1982  Proc.  Indian  Natl.  Acad.  A48  119 
Payne  R,  Mu._-c  R  J,  Sarode  P  R  and  Rao  CNR  1981  Inorg.  Nucl.  Chem.  Lett.  17  125 
Sankar  G,  Sarode  P  R  and  Rao  CNR  1983  Chem.  Phys.  76  435 

Shulman  R  G,  Yaffet  Y,  Eisenberger  P  and  Blumberg  W  E  1976  Proc.  Natl.  Acad.  Sci.  USA  73  1384 
Simonov  Y  A  and  Malinovskii  T  I  1970  Sou.  Phys.  Cryst.  15  310. 

Srivastava  U  C  and  Nigam  H  L  1972  Curr.  Sci.  41  81 
Stern  E  1974  Phys.  Rev.  BIO  3207 

Teo  B  K  198 1  in  exafs  spectroscopy:  Techniques  and  applications  (eds)  B  K  Teo  and  D  C  Joy,  Chapter  3  (New 
York:  Plenum) 

Teo  B  K  and  Lee  P  A  1979  J.  Am.  Chem.  Soc.  101  2815 

Via  G  H,  Sinfelt  J  H  and  Lytle  F  W  1979  J.  Chem.  Phys.  71  690 


I 


^ 


'Ti-.-T^-*'r  ^7:' 
t  » '  - 


< 


V'. 


&4 


V  V" 


*  V 


■>'. 


K'  < 


r- 


,-  5*-.: 


'.  -f  - ? 

s 


■  ^  *1*  r 


':  -'t' 

»■  ?' 


r  » 

> 


rV_,  '  ■* 

■  if 


i.  « 


1 


■:f^ 

'  V 


> 

^  * 


■* 

V 


♦ 


V 


^  .■  '  I. 

c 


'•4 


T  / 

,5 


ft .,  * 

V 


4r‘ 


A« 


•'  t  ; 


•i . 


.*V 


«^... 


b? 


V  p 


r'^ ..- 


t'r' 

i-- 


ft 


^  ■ 

I  ^ 


4 


.« 

4 


<» 


;•  . 


-  Ti.- 


vv 


r  t 


-  'A 


V- 


V 

•* 


*•  -A 


t  ^ 


4 


/• 


vi' 


♦Sl 

f. 


<• 


«» 


♦'  .K** 


I  . 


Iff 


V  •  '  . 

»•  .  f 


«A 

4 


■  \\‘ 


♦ 


'■-  4..' 


ii  ! 

■  i 

M 


/ 


<«* 


i-  ••^  :  '  :• 

'  i  *■  ‘C 

V  f  ^-.-s 

A  '  •♦T  ■ 


1^* 

.  .< 

i 


T-X 


i 


r- 


»  * 


¥ 

S 


,4 


■■? 

r 


%  L.^  - 


'A-*  ^:^>-' 


:»b.  .f.V..  4>i&( 


..  -  ./»  •!- 


